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(57) Abstract 



The regulation of gene expression is of fundamental importance to all biological functions including adaptation to environmental 
conditions, cell division and differentiation, and the development of disease states such as cancer. The application of "reporter" gene 
systems, which allow changes in gene expression to be assayed quickly and easily, has contributed greatly to our understanding of the 
mechanisms involved in gene regulation. A reporter gene system comprising a gene encoding an tv-amylase enzyme for use in transforming 
mammalian cells is described. In a further example the reporter gene system comprises a second a— amylase gene, which codes for an 
enzyme with an electrophoretically distinct mobility from that of the first a-amylase. The two genes are used for simultaneous expression in 
mammalian cells, with one isozyme acting as an internal control, and the other isozyme acting as the indicator gene. The use of « -amylase as 
a secreted reporter gene is particularly attractive because the enzyme is extremely stable over a wide range of conditions. The measurement 
of » -amylase activity is simple, quantitative, sensitive, safe and inexpensive, and the range of electrophoretically distinguishable variants 
available allows the assays to be performed in virtually any host without interference from endogenous amylase activities. 
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Secreted a- Amylase As A Reporter Gene 



The present invention relates to a secreted reporter gene system. More 
specifically, the present invention relates to a secreted reporter gene system based on 
5 one or more a-amylases, which are electrophoretically distinct variants of each other, 
for simultaneous expression in mammalian cells. 

BACKGROUND OF THE INVENTION 

Reporter genes come in two basic varieties; those that encode intracellular 
10 enzymes, and those that code for secreted reporter proteins. Most reporter genes 
currently in use are of the intracellular variety that require manipulation of the cells, 
normally harvesting and lysis, prior to measurement of the reporter activity. In 
contrast, cells transformed with constructs that encode secreted reporter enzymes do 
not require manipulation or lysis of the cells for activity measurements. Instead, 
15 aliquots of the extracellular medium are harvested and assayed directly for enzyme 
activity. Thus, secreted reporter genes can be used to monitor gene expression in 
vitro, in vivo and ex vivo. 

There are a number of commercially available reporter genes that encode 
20 intracellular enzymes, e.g. green fluorescent protein (GFP; Chalfie et aL, 1994, 
Science 263: 802-805 ), luciferase (de Wet et aL 7 1987, Mol. Cell. Biol. 7: 725-737), 
chloramphenicol acetyltransferase (CAT; Gorman et aL, 1982, Mol. Cell. Biol. 2: 
1044-1051). However, none of these intracellular enzymes are suitable as a secreted 
reporter activity. For instance, the luciferase reporter gene is used widely due mainly 
25 to the sensitivity of the detection system and the lack of background activity in most 
biological systems. However, the luciferase enzyme is relatively unstable under any 
conditions, and is completely and irreversibly inactivated by the process of secretion. 

Secretable human placental alkaline phosphatase (SEAP); a truncated form of 
30 membrane-bound alkaline phosphatase; (Berger, J., Hauber, J., Hauber, R., Geiger, 
R., Cullen, B.R. 1988. Gene. 66: 1-10; Cullen, B.R., and Malim, M.H. 1992. 
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Methods Enzymol. 216: 362-368) is a reporter gene that is distributed by.Clontech as 
a secreted reporter gene/enzyme system (see European Patent application 0 327 960). 
However, many biological systems display endogenous (background) alkaline 
phosphatase (AP) activity. For example, the high level of alkaline phosphatase activity 

5 in cows 1 milk precludes the use of SEAP as a reporter activity for experiments aimed 
at using the bovine udder as a "bioreactor" in the production of bioengineered milk. 
A second commercially-available, secreted reporter gene is human growth hormone 
(hGH, Boehringer Mannheim; Selden, R.F., Howie, K., Rowe, M.E., Goodman, H., 
Moore, D. 1986, Mol. Cell. Biol. 6: 3173-3179). However, since hGH has no easily 

10 measurable biological activity, the detection system for this secreted product uses a 
laborious, indirect antibody-based ELISA method. 

Thus, there is a need for an improved secreted reporter gene that overcomes the 
problems of the prior art systems. Specifically there is a need for a secreted reporter 
15 gene encoding a biological activity that, unlike hGH, is safe and easily measured, but 
that does not have the background activity problems associated with SEAP. 

SUMMARY OF THE INVENTION 

Thus according to the present invention there is provided a secreted reporter 
20 gene system for mammalian cells. More specifically the present invention relates to 
a secreted reporter gene system based on one or more variants of a-amylase, that have 
different electrophoretic mobilities from that of mammalian a-amylases, and that can 
be efficiently expressed and secreted by mammalian cells. 

25 The reporter gene system of the present invention comprises a DNA sequence 

with the following components: 1) a signal peptide coding region; 2) a sequence 
encoding one or more variants of the a-amylase mature protein; and 3) a transcription 
termination region. 

30 In one embodiment of the present invention, the a-amylase gene is the chicken 

(Gallits gallus) a-amylase gene. 



WO 98/49320 PCT/CA98/00157 

3 

In one embodiment of the present invention, the a-amylase gene is the fruit fly 
(Drosophila melanogaster) a-amylase gene. 

In one embodiment of the present invention, the reporter gene system comprises 
5 a DNA sequence based on the a-amylase gene from the chicken (Gallus gallus), in 
conjunction with the a-amylase gene from the fruit fly {Drosophila melanogaster), in a 
dual-reporter plasmid. 

In a further embodiment of the present invention, the signal sequence is based on 
10 the Drosophila melanogaster a-amylase secretion signal. 

In yet a further embodiment of the present invention the transcription termination 
region is based on the Drosophila melanogaster a-amylase gene termination region. 

15 In a further aspect of the present invention the reporter gene system also includes 

a transcription promoter region. 

The present invention is also directed to a vector comprising the reporter gene 

system. 

20 

The present invention also defines a procedure to assay for a-amylase activity in 
complex biological mixtures of proteins. 

In yet a further embodiment of the present invention, there is provided a plate 
25 diffusion assay, which is used to measure a-amylase activity from a large number of 
samples. 



30 



Also included within the scope of the present invention is a kit including the 
reporter gene system of the present invention and reagents for testing a-amylase activity. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

These and other features of the invention will become more apparent from the 
following description in which reference is made to the appended drawings wherein: 
FIGURE 1 shows the sequence of the chicken a-amylase cDNA (SEQID No: 1). 
FIGURE 2 shows an example of the reporter gene construct which does not contain 
a promoter region. The a-amylase mature peptide coding sequence "amylase", 
and transcription termination region "tef are indicated. The solid black 
rectangle between the promoter and the amylase coding region represents the 
secretion signal peptide. Restriction enzyme recognition sites are as follows: 
As = Ascl ; Hi = //wdlll; Nc = NcoI\ Nh = Nhel; No = Noil. The 8-cutter 
sites Notl and Ascl were built into the upstream and downstream ends, 
respectively, of the reporter gene in order to facilitate its insertion into the 
reporter construct. 

FIGURE 3 shows an example of the reporter gene construct which contains a 
promoter region. The description is as above for Figure 2, except the 
transcription promoter 'pro' is also indicated. 

FIGURE 4 shows the sequence of the transcription termination region (SEQID No: 2). 

FIGURE 5 shows the electrophoretic mobilities of a-amylases from 3 different 
sources in a native gel assay. Lane 1, pig pancreatic amylase; Lane 2, D. 
melanogaster amylase; Lane 3, chicken pancreatic amylase. 

FIGURE 6 shows the production of the reporter activity by mammalian cells in 
culture. Lane 1, chicken pancreatic amylase control; Lane 2, mouse (PG13) 
cells; Lane 3, mouse (PA317) cells; Lane 4, hamster (CHO) cells; Lane 5, 
monkey (MA104) cells, Lane 6, cow (ET2) cells; Lane 7, human (Hela) cells; 
Lane 8, untrans formed mouse cells (PG13). 

FIGURE 7 shows the expression levels of amylase under different promoters. Lane 
1, medium from control, untransformed PA317 cells; Lane 2, cells transformed 
with reporter construct containing LTR:amy reporter gene; Lane 3, cells 
transformed with reporter contract containing Actimamy reporter gene; Lane 
4, cells transformed with reporter construct containing CMViamy reporter gene; 
Lane 5, chicken amylase standard. 
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FIGURE 8 shows the detection of amylase activity in cows' milk. Lane 1, whole 
milk; Lane 2, whole milk spiked with chicken amylase; Lane 3, empty; Lane 
4, chicken amylase standard; Lane 5 & 6, milk following removal of caseins; 
Lanes 7 & 8, milk with amylase spike following removal of caseins. 

FIGURE 9 shows the isolation and detection of cc-amylase in cows' pre-lactation 
secretions (pre-milk). Lanes 1, 2 & 3, early premilk from 3 quarters of cow 
#99. Lane 1, control quarter (RF-); Lane 2, treated quarter (RH + ); Lane 3, 
treated quarter (LH-h). Lanes 4, 5 & 6, late premilk from cow #99. Lane 4, 
treated quarter (LH+), Lane 5, treated quarter (RH+); Lane 6, control quarter 
(RF-). Lane 7, mixture of pig and chicken amylase standards; Lane 8, chicken 
amylase standard. 

FIGURE 10 is a schematic depiction of the plasmids used in the dual-reporter system. 
"Fly" refers to D. melanogaster, "chick" refers to G. gallus. Abreviations 
are: ss=signal sequence; amy = a-amylase; CMVp== immediate early promoter 
of the human cytomegalovirus; LpA=late polyadenylation site; ori = origin of 
replication; TB= transcriptional blocker; actin p = rat P-actin promoter. 

FIGURE 11 shows the production of a-amylase reporter activities by cultured 
mammalian cells transfected with either the pUC-TEST plasmid or the pU(S- 
CONTROL plasmid. Lane 1, chicken amylase standard; Lane 2, fly amylase 
standard; Lane 3, medium from control, untransformed PA317 cells, Lane 4, 
cells transformed with pUC-TEST=base dual reporter construct; Lane 5, cells 
transformed with pUC-CONTROL=dual reporter construct containing the actin 
promoter inserted upstream of the fly amylase reporter gene. 

FIGURE 12 shows the reporter activites of PA317 cells co-transformed with chicken 
a-amylase and the firefly luciferase. The reporter genes in each co- 
transformation were placed under the same promoter: the murine retroviral 
LTR, the rat p-actin promoter, or the human CMV early immediate promoter. 
Results for amylase activity were obtained by densitometric scanning of the 
photograph of the native gel containing the amylase activity obtained from the 
cell supernatant. Luciferase activity was assayed from cell lysates according 
to manufacturer's instructions (Promega Corp). 



WO 98/49320 PCT/CA98/001 57 

6 

DESCRIPTION OF PREFERRED EMBODIMENT 

The present invention relates to a secreted reporter gene system. More 
specifically, the present invention relates to a secreted reporter gene system based on 
one or more a-amylases, which are electrophoretically distinct variants of each other, 
5 and also distinct from mammalian a-amylases, for simultaneous expression in 



An a-amylase reporter gene encodes a biological activity that is easily measured 
by a variety of liquid or semi-solid phase assay systems. The use of a native gel assay 

10 system eliminates any potential interference by endogenous amylase activities produced 
by the transformed, mammalian target cells. In addition, the advantage of using a 
secreted reporter activity is that it is not necessary to harvest cells from the culture 
dish, or to biopsy a transformed tissue (e.g. udder epithelium), nor is it necessary to 
lyse cells prior to assay. Instead, the cell culture supernatant or biological fluid (e.g. 

15 milk) is recovered from the transformed culture or animal and assayed for reporter 
activity. This feature has several advantages such as the ability to conduct multiple 
measurements over time on a single transformed population of cells (time course 
measurements). 

20 Since a-amylase exists in a number of distinct isoforms, an additional 

advantage to using the amylase reporter system is that a different isozyme can be used 
as an internal control in transfection experiments. The "normalizing" isozyme reflects 
a standard level of gene expression, while the "test" isozyme is the indicator enzyme. 
Since the isozymes have different migration patterns on the native gel, they can be 

25 assayed simultaneously within the same gel lane. This simplifies the assay procedure, 
in contrast to other reporter systems where the test and normalizing activities require 
separate assays. 

The chicken and Drosophila a-amylase enzymes are two examples of suitable 
30 a-amylases, which can be used alone or together in both embodiments of the present 
invention. These enzymes are compatible with the mammalian cellular translation and 
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secretion machinery. Therefore, the mammalian cells are able to produce and secrete 
them efficiently in an authentic, bioactive form. Furthermore, these a-amylases have 
the same physio-chemical requirements for optimal activity and stability, for example, 
pH optimum, calcium activation, etc., as mammalian amylases. These features are 
5 essential for their use as reporter activities. In addition, the electrophoretic mobilities 
of the mature, active chicken and Drosophila a-amylase enzymes on native 
polyacrylamide gels are different from that of mammalian amylases. Therefore if any 
amylase enzyme is secreted from mammalian cells, the reporter activities can easily 
be differentiated from the background activity. 

10 

The absence of background activity is particularly important when measuring 
reporter activities in biological fluids, such as cows* milk, which contains high 
background levels for other reporter systems, e.g. the SEAP reporter system. 
According to the present invention, the a-amylase reporter system is mosLuseful in 
15 complex protein mixtures such as biological fluids, specifically fetal calf serum, milk, 
and pre-lactation secretions, because the reporter a-amylase activity . can be 
quantitatively recovered, using the isolation procedure included in this invention, from 
such fluids, which will be described in detail below. 

20 Included within the scope of the present invention are modified DNA sequences 

encoding functionally active a-amylases from chicken and Drosophila. Possible 
modifications include but are not limited to: 1) nucleotide substitutions that eliminate 
restriction enzyme (RE) sites in the naturally-occurring sequence, but do not alter the 
amino acid sequences, and the use of these RE sites in a multiple-cloning-region for 

25 the introduction of DNA components, e.g. the transcription promoter; 2) conservative 
amino acid substitutions; 3) the enhancement of the specific activity of the reporter 
polypeptide by amino acid substitutions; 4) increasing the temperature stability of the 
reporter enzyme by amino acid substitutions; 5) creation of hybrid reporter enzymes 
from portions of genes coding for different isozymes. Also included is the potential 

30 use of amylase inhibitors to specifically inhibit any endogenous, mammalian amylases 
that might be present in the extracellular medium of the transformed cells. 
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The reporter gene system of the present invention also includes a DNA 
sequence encoding a signal peptide coding region, which in one embodiment is based 
on the Drosophila melanogaster a-amylase pre -protein. This signal peptide works 
effectively in combination with the chicken a-amylase enzyme and the mammalian 
cellular secretion machinery. The DNA sequence encoding the signal peptide was 
modified to include useful restriction enzyme sites for the introduction of promoter 
sequences to drive the reporter gene. The present invention also encompasses any 
modifications of the signal peptide region that may increase the efficiency of the 
secretion process beyond the current level. 

The reporter gene system of the present invention also includes a transcription 
termination region. In one embodiment of the present invention the transcription 
terminator region is derived from the Drosophila melanogaster amylase gene. This 
portion of the reporter gene serves the following functions: (1) to provide a 3' non- 
coding region for the transcript; (2) to provide a transcription termination signal for 
the RNA polymerase complex; and (3) to provide a poly-adenylation motif to the 
primary transcript. In the example where two amylase genes are used in the reporter 
gene system and one of said genes is the D. melanogaster a-amylase gene, the 
terminator region comes from the late polyadenylation site of SV40, to minimize 
possible recombination with the a-amylase cassette. 

Included within the scope of the present invention is the addition of any 
functional DNA elements, such as transcription enhancer elements, introns, etc., to the 
transcription terminator region in order to enhance the overall level of expression of 
the reporter gene. 

In one embodiment of the present invention, the reporter gene system also 
includes a DNA sequence encoding a transcription promoter region. When the 
reporter gene system of the present invention includes a promoter, the primary use of 
this secreted reporter gene system is to monitor the efficiency of the introduction of 
foreign DNA into mammalian cells in vivo, in vitro and ex vivo. This application 
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facilitates experiments designed to optimize the conditions, methods and vehicles 
employed for the introduction of foreign DNA into mammalian cells. There are a 
number of high expression level, commercially available promoters which are suitable 
for this application. Not to be construed as limiting, possible examples include: a 
5 Cytomegalovirus promoter, a murine retroviral promoter and a rat beta-actin promoter. 

One aspect of the invention would be the use of a tissue-specific promoter to 
drive the expression of the reporter gene. This configuration would enable the 
researcher to distinguish between expression in target and non-target cell populations. 

10 Furthermore, it would allow researchers to optimize their delivery systems for 
targeting the foreign DNA to cell types that have the greatest capacity for producing 
and secreting foreign peptides. This aspect is especially important for in vivo gene 
transfer. Specific applications would include gene transfer to udder epithelial cells in 
mammals for the production of pharmaceutical agents into the milk (the somatic 

15 bioreactor concept), and DNA transfer for gene therapy in humans. 

The construction of the reporter gene involves the use of standard recombinant 
DNA methods, including the use of restriction enzymes to fuse DNA molecules witl* 
coherent ends, the amplification of DNA fragments using the polymerase chain reaction 

20 (PCR; Canadian patent 1,237,685), and the synthesis of DNA fragments using 
phosphoramidite chemistry (Matteucci and Caruthers, 1981, J. Am. Chem. Soc. 103: 
3185-3191). The application of the reporter gene involves the insertion of the gene 
into a suitable vector to form double-stranded, circular DNA molecules that are 
delivered to the eukaryote cells. General methods for the preparation and modification 

25 of recombinant DNA molecules have been described by Cohen et aL [U.S. patent No. 
4,237,224], Collins et aL [U.S. patent No. 4,304,863], Sambrook et aL (1989, 
Molecular cloning: A laboratory manual, Cold Spring Harbor Laboratory), and Mullis 
and Faloona (1987, Methods Enzymol. 155: 335-350) . Because they illustrate much 
of the state of the art, these references are hereby incorporated by reference. 



30 



WO 98/49320 PCT/CA98/00157 

10 

Once assembled, the vector carrying the reporter gene is delivered to the target 
cells by one of a number of commonly used methods, e.g. viral vectors, calcium 
phosphate co-precipitation, or liposome-mediated delivery. The DNA plasmid vector 
used to carry the reporter gene can provide for either transient or stable maintenance 
of the reporter gene in the target, host cells depending upon the type of vector 
employed. 

The production and secretion of proteins, including reporter enzymes, by 
eukaryotic cells is a multi-stage process. Briefly, the process begins with the 
transcription of the gene encoding the secreted product, followed by the processing of 
the primary transcript into mRNA, and the transport of the mature messenger into the 
cytoplasm. The mRNA is translated into the preprotein and simultaneously 
translocated into the first compartment of the secretory pathway by ribosomes 
associated with the endoplasmic reticulum. Targeting of the preprotein to the secretion 
pathway is mediated by a secretion signal peptide. Within the endoplasmic reticulum, 
golgi apparatus and the secretory vesicles, the primary translation product undergoes 
sorting and maturation to produce the final bioactive protein which is released into the 
extracellular medium. The present invention provides for novel reporter activities 
based on the chicken a-amylase gene and other electrophoretically distinct variants of 
a-amylase, which are compatible with and can be secreted efficiently by mammalian 
cells. The present invention also includes a secretion signal sequence that effectively 
targets the reporter peptide to the secretory machinery of mammalian producer cells. 

There are a number of prior reports on the expression and secretion of a- 
amylase in microbial systems. In addition, Japanese Abstract 63263086 described the 
use of a-amylase as an indicator gene in a "promoter trap" construct for use in both 
prokaryote and eukaryote systems. The present system however is preferred, in that 
this system is designed to measure gene expression and DNA delivery specifically in 
mammalian cells using electrophoretically distinguishable variants in combination with 
an assay system optimized for use with biological fluids. 
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The present invention also includes an isolation procedure whereby the reporter 
activity can be quantitatively recovered from complex protein mixtures such as 
biological fluids, specifically fetal calf serum, milk, and pre-lactation secretions. The 
isolation protocol, though based on the procedures of Vretblad, P., 1974, FEBS Lett. 
5 47(1): 86-89, Silvanovich, M. P., & Hill, A. D., 1976, Anal. Biochem. 73: 430-433, 
and Iefuji, H., et al., 1996, Biochem. J. 318: 989-996, was adapted especially for use 
in concentrated protein mixtures. Due to the high protein content, the reporter activity 
cannot be assayed using the original protocols. The present protocol differs from the 
original procedures in that 1) the non-specific proteins in the fluid are removed by 
10 addition of 40% ethanol, and 2) the ethanol-containing solution is applied directly onto 
the immobilized p-cyclodextrin to further reduce the nonspecific background. 

The present invention also includes a "plate diffusion" assay which is one 
method for measuring the reporter activity when large numbers of samples, for 

15 example, culture supernatants from a population of stably selected clones, must be 
assayed. The method described here is adapted from the observation that colonies-of 
amylase-producing microorganisms form clearing-zones on starch plates (Iefuji, II.. 
et al. 1994. Biosci. Biotechnol. Biochem. 58: 2261-2262). In the amylase reporter 
system, the amylase-containing solutions are applied to wells formed in starch- 

20 containing agarose plates; diffusion of the solution through the starch-agarose forms 
clearing zones whose size correlates with the concentration of amylase in the solution. 

Under conditions where cells are transiently transfected with the reporter 
plasmid, the amount of secreted reporter activity measured in the extracellular medium 

25 of a population of cells, transformed with a multicopy plasmid carrying a reporter gene 
which is driven by a strong, systemic promoter, is determined by two variables: 1 ) the 
number of cells that have received the reporter construct and; 2) the average number 
of copies of the DNA construct that each transformed cell received. Thus, under these 
conditions, the secreted reporter activity provides a measure of the efficiency of the 

30 DNA delivery system, i.e. how many cells in the population were transformed with 
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foreign DNA and how many copies, on average, of the transforming vector were 
incorporated into each transformed cell. 

Under conditions where the cells are stably transfected with the reporter 
plasmid, the level of secreted reporter activity would, in addition, depend on: 1) the 
site at which the reporter gene becomes integrated, 2) the metabolic state of the cells, 
and 3) the environmental conditions that may regulate the cell's metabolism in general, 
and the promoter that drives the reporter gene in particular. If two of the above 
conditions are experimentally controlled, the secreted reporter gene can be used to 
monitor the effects of the third variable in time-course-type experiments. 

The present invention also includes amylase reporter kits. Kits containing a 
single or double amylase gene are included within the scope of the present invention. 

The single reporter kit would include: two DNA constructs; a plate assay 
means; and instructions for use. The single reporter kit would optionally include a 
beta-dextrin purification system, which would be needed in high-protein fluids to allow 
detection of the amylase activity. 

The first DNA construct includes a test plasmid containing a single gene 
coding for a single amylase isozyme, with a multiple cloning region just upstream 
where a test promoter can be inserted. The second DNA construct includes a control 
plasmid where the gene is placed under control of a strong promoter. The plate assay 
detects amylase activity very simply. It consists of : a mould to form the gel plate; a 
buffer used to make the agarose media; detection solutions; and a sample of amylase 
as a positive control. The beta-dextrin system, if included, removes non-specific 
proteins from high-protein fluids to allow detection of the amylase activity. It consists 
of: an immobilized beta-dextrin packed into columns; equilibration buffer; wash 
buffer; and elution buffer. The buffers could be included within the kit or be 
substituted with instructions on the buffer preparation. The kit will also include a 
protocol booklet, which would give detailed procedures, as well as simple steps used 
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to remove background signals from high-protein biological fluids such as serum- 
containing media, pre-lactation secretion, and milk. 



The dual amylase report kit will include: two DNA constructs; a gel 
electrophoresis assay means; and instructions for use. The dual reporter kit would 
optionally include a beta-dextrin purification system, which would be needed in high- 
protein fluids to allow detection of the amylase activity. 

In this system the first DNA construct includes a test plasmid (containing the 
genes coding for both amylase isozymes, one of which is under a "normalizing" 
promoter). The second DNA construct includes a control plasmid containing the 
isozymes under different promoters. Gel electrophoresis allows unambiguous 
identification of the 2 isozymes. The assay consists of: a sample buffer; gel 
preparation buffer; detection solutions and amylase isozymes as positive controls. As 
with the single amylase reporter kit the buffers can be included within the kit or they 
can be replaced with instructions on the buffer preparation. 

If included, the beta-dextrin system is identical to that contained in the Single 
Reporter Kit and the protocol booklet is identical to that contained in the Single 
Reporter Kit. 

The kit of the present invention is much more inexpensive than any expression 
kit presently available. Furthermore, amylase is a very stable enzyme, whose signal 
improves with duration of expression. The dual expression kit allows for 
normalization of the expression data. Endogenous levels, as mentioned previously are 
not a problem with this detection system. 

While this invention is described in detail with particular reference to preferred 
embodiments thereof, said embodiments are offered to illustrate but do not limit the 
invention. 
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EXAMPLES 

EXAMPLE 1: Chicken a-amylase cDNA 

In order to isolate a chicken genomic a-amylase clone, a commercial chicken 
library in the vector X FIX II (Stratagene) was screened with a murine amylase cDNA 
probe (1.2 kbp PstI fragment of pMPa21, Hagenbuechle et al. 9 1980, Cell 21: 179- 
187). Hybridization was carried out under conditions of reduced stringency at 37°C 
in a standard buffer system containing 50% formamide (Benkel and Gavora, 1993, 
Animal Genetics 24: 409-413). Following hybridization, the filters were washed twice 
for 15 min each in 2x SSC, 0.1% SDS at 42°C, followed by a single wash in 0.5x 
SSC, 0.1% SDS at 50°C for 30 min. Autoradiograms were exposed at -70 C with 
intensifying screens. 

Eighteen positive signals were detected following the first round of library 
screening. Ten plaques were chosen at random for second screening, and 4 of these 
isolates were still positive after a third round of screening. One clone (A.A-1) was 
chosen for large-scale DNA isolation and fragment subcloning in preparation for 
sequence analysis. 

A series of overlapping subclones spanning the amylase genomic region was 
constructed by inserting restriction fragments of the primary isolate A.A-1 into the 
vector pUC18. Double-stranded DNA was sequenced by the gene-walking method 
using synthetic oligonucleotide primers. Primers were synthesized on an Applied 
Biosystems 392 synthesizer and deblocked and desalted before use. Sequencing 
reactions were performed using the dye terminator cycle sequencing kit as described 
in the instructions supplied by the manufacturer (Applied Biosystems Inc.)- The 
extension products were analyzed on an Applied Biosystems 373A automated 
sequencer, and sequence assembly was performed using MicroGenie software by 
Beckman. 

The chicken amylase coding region was prepared using reverse transcription- 
poly merase chain reaction technology (RT-PCR). Oligonucleotide primers for the PGR 
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step were designed based on the chicken genomic amylase sequence derived as 
described above. Approximately 2 fig of total RNA from chicken pancreas was used 
as substrate in a reverse transcription (RT) reaction using the Perkin Elmer RT-PCR 
kit components according to the instructions supplied by the manufacturer. The RT- 
5 reaction was primed with oligo-dT. The oligonucleotide primers used for the PCR 
stage of the RT-PCR reaction were as follows: (i) Chamy-Nhe (5'- 
ATGCIAGCTCAGT ACAATCCC AAC ACTC AGGCT-3 1 , SEQID No: 3) which spans 
the position in the coding region corresponding to the N-terminus of the mature 
enzyme; and (ii) Chanry-Hin (5 ! -CGAAGCTTATAACTTGGCATC A 

10 ACGTGAATTG-3 ' , SEQID No: 4) which spans the stop codon of the amylase coding 
region. The primers were designed to amplify the region in the gene encoding the 
mature a-amylase peptide. In addition, Chamy-Nhe converts the environment of the 
signal peptidase cleavage site into an Nhel site, while Chamy-Hin adds a Hindlll site 
immediately downstream of the stop codon (enzyme recognition sites are underlined). 

15 The modifications introduced by the PCR primers allow the amplified cDNA to be 
inserted into the reporter construct using cohesive overhang ligation. 

PCR amplification of the chicken amylase cDNA was performed using the LA- 
PCR kit as described in the manufacturer's product bulletin (TaKaRa). The amplified 
20 fragment was sequenced using PCR-based chain termination technology, prior to 
insertion into the reporter construct - this sequence, SEQ ID No: 1, is shown in Figure 
1. 



EXAMPLE 2: Signal Peptide Coding Region 

The signal peptide used in the current reporter gene is modelled on the sequence 
found in the Amy\ gene of the Oregon-R strain of Drosophila melanogaster (Boer and 
Hickey, 1986, Nuc. Acids Res. 14: 8399-8411). The sequence of the native signal 
peptide extends from nucleotide 1 to 54 in Figure 2 of that reference. The 5'-end of the 
sequence encoding the signal peptide was modified to incorporate an Ncol site straddling 
the translation start codon (ATG) - i.e. the sequence AT ATG T was changed to CC ATG 
G. This modification changes the amino acid immediately downstream of the initiator 
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Met from a Phe to a Val. In addition, the 3'-end of the signal sequence was modified to 
accommodate an Nhel site. Here, the original D. melanogaster sequence of - 
Ala-Asn-Ala- (GCC AAC GCC) was changed to --Ala-Leu-Ala- (GCG CTA GCC). 
The DNA fragment encoding the signal peptide was constructed in the laboratory on a 
DNA synthesizer, and assembled into the reporter construct using the restriction enzyme 
sites built into its ends. The novel signal peptide sequence used in the reporter construct 
has the following sequence (SEQID No: 5). 

5 " -CCATGGTTCTGGCCAAGAGCATAGTGTGCCTCGCCCTCCTGGCGGTGGCGCTAGCT-3 1 
3 • - GGTACCAAGACCGGTTCTCGTATCACACGGAGCGGGAGGACCGCCACCGCGATCGA- 3 1 



EXAMPLE 3: Promoter Region 

In this example the immediate early promoter of the Human cytomegalovirus 
(CMV; Stinski and Roehr, 1985, J. Virology 55: 431-441) was used to drive the 
expression of the reporter gene. 

Other examples of promoters that have been used successfully in the reporter 
gene include; a murine retroviral LTR, which is a composite of the MMTV (Mouse 
Mammary Tumor Virus; Ponta et aL 1985, Proc. Natl. Acad. Sci. 82: 1020-1024) and 
the MoMSV (Moloney Murine Sarcoma Virus; Lin et al. 1 990, Proc. Natl. Acad. Sci. 87: 
36-40) LTRs, and the rat beta-actin promoter (Nudel et al. 1983, Nucl. Acids Res., 1 1 , 
1759-1771). 

DNA fragments to be used as promoters were amplified by PCR using 
oligonucleotide primers that incorporated useful restriction enzyme sites at the upstream 
(Not!) and downstream (Ncol) ends of the amplified promoter sequences. Restricted 
promoters fragments were inserted into the reporter gene construct by cohesive end 
ligation. 
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EXAMPLE 4: Transcription Termination Region 

The final component of the reporter gene is the transcription terminator. In this 
example 567 bp of the Drosophila melanogaster Oregon-R amylase gene termination 
region was used. The sequence of this section (SEQID No: 2) is shown in Figure 4. The 
5 DNA fragment was amplified by PCR using primers that incorporated useful restriction 
enzyme sites. A HindlEL site was added just upstream of the TAA (stop codon) of the 
amylase coding region. This TAA is out of frame with the chicken amylase coding 
region, which brings in its own stop codon. The downstream end of the terminator is 
modified into an A scl site. 

10 

EXAMPLE 5: Drosophila melanogaster a-amylase cDNA 

Plaque lifts of a lambda gtlO library containing cDNAs from RNA of a 
Drosophila melanogaster Oregon-R strain were hybridized to a mouse salivary a- 
amylase cDNA probe (the 1650 bp PstI restriction fragment of pMSal04; 
15 Hagenbuechle el al. , 1980, Cell 21 : 179-187), under conditions of reduced stringency - 
38°C in a hybridization buffer containing 50% formamide. Following hybridization, 
filters were washed in 1 X SSC, 0.1% SDS at 52°C. 

Eight postive plaques were identified from an estimated 20,000 plaques screened. 

20 The longest of these cDNA inserts (OR-M7) was subcloned into a plasmid vector 
(pUC13) and subjected to DNA sequencing by standard methods (Benlcel et al., Genome 
29: 510-51 5). The DNA sequence was determined as described in Boer & Hickey, 1 986, 
NucL Acids Res. 14: 8399-841 1). The extension products were analyzed on an Applied 
Biosystems 373A automated sequencer, and sequence assembly was performed using 

25 MicroGenie software by Beckman. OR-M7 was found to contain the complete coding 
region of the Drosophila melanogaster a-amylase pre-enzyme. 

The DNA fragment encoding the mature Drosophila melanogaster a-amylase 
enzyme was inserted into expression vectors by PCR amplification using the LA-PCR . 
30 kit and synthetic oligonucleotide primers. The oligonucleotide primers used for the 
PCR amplification were as follows: (i) Dm-Ql (5 1 - ATGCIAGCGC A ATTCGAC AC 
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CAACTACGCA-3', SEQID No: 6) which spans the position in the coding region 
corresponding to the N-terminus of the mature enzyme; and Dm3-2 (5'- 
C A AGCTT GTTT AC AACTTGGCGTTG ACGT-3 ' , SEQID No. 7) which spans the 
stop codon of the amylase coding region. The primers were designed to amplify the 

5 region in the gene encoding the mature cc-amylase peptide. In addition, Dm-Ql 
converts the environment of the signal peptidase cleavage site into an Nhel site, while 
Dm3-2 adds a HindTO. site immediately downstream of the stop codon (enzyme 
recognition sites are underlined). The modifications introduced by the PCR primers 
allow the amplified cDNA to be inserted into the reporter construct using cohesive 

10 overhang ligation. The modified Drosophila a-amylase cDNA is SEQID No. 8. 

EXAMPLE 6: The oc-amylase dual-reporter gene system 

Construction of the "test" and "control" plasmids began with the plasmid pIBI- 
B, which was formed from the vector pIBI25 (International Biotechnologies Inc.) by 
15 techniques of PCR amplification using primers with redundant overhangs (Scharf, 
1990), and standard oligonucleotide linker addition (Sambrook et al., 1989). pIBI-B 
consists of: (i) Drosophila a-amylase promoter; (ii) the Drosophila a-amylase signal 
sequence, as described in Example 2; and (iii) the Drosophila a-amylase termination 
region, as described in Example 4. 

20 

The region spanning the chicken a-amylase cDNA was amplified by long-range 
PCR (LA-PCR kit, TaKaRa), incorporating a Nhel site at the 5' primer and a HinaUl 
site at the 3' primer. This PCR product was ligated into the plasmid pffil-B via the 
Nhel and Hindm sites to form the plasmid pIBI-fAmy, which contains chicken a- 

25 amylase joined to Drosophila amylase signal sequence. The CMV promoter was 
amplified by PCR from the plasmid pLNCX (Miller, D.M., & Rosman, G.J., 1989, 
BioTechniques, 7(9): 980-990), then inserted into the plasmid pIBI-fAmy, just 
upstream of the 5' end of the Drosophila signal sequence, replacing the Drosophila 
a-amylase promoter and forming the plasmid pIBI-CfAmy. The entire region 

30 spanning the CMV promoter to the 3' end of the chicken a-amylase gene was amplified 
by long-range PCR, incorporating unique sites at the 3' and 5' ends. This PCR 
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product was ligated to the 31 lObp HindHL-Fsel fragment of the plasmid pSEAP2-Basic 
(Clontech), forming the plasmid pUC-CFAmyHF. The Drosophila a-amylase gene, 
including the signal sequence, was amplified by PGR from the cDNA clone; the PCR 
product was inserted upstream of the chicken a-amylase cassette in the plasmid pUC- 
CFAmyHF to form the final plasmid pUC-TEST (SEQID No: 9). This "test" plasmid, 
shown in Figure 10, has the chicken a-amylase gene driven by the CMV promoter. 
The Drosophila a-amylase gene is not under any promoter; instead, there is a 
poly linker sequence 5' of the Drosophila a-amylase gene that allows promoters of 
interest to be inserted. In SEQID No:9 the components are located in the following 
positions: Drosophila a-amylase signal sequence: 83-134; Drosophila a-amylase: 
135-1575; Drosophila a-amylase terminator: 1588-2128; CMV promoter: 2129-2951; 
Drosophila a-amylase signal sequence: 2952-2999; chicken amylase: 3000-4505; SV40 
late polyadenylation site: 4506-4754; pUC origin of replication: 5033-5676; 
p-lactamase gene: 6684-582; p-lactamase promoter region: 6754-6685; Fl origin: 
6816-7272; synthetic polyadenylation site: 7462-7450; and pause site from humart 
a2-globin gene: 7464-7555. 

r 

To construct the plasmid pUC-CONTROL, the rat p-actin promoter was 
inserted between the transcription block and the 5' end of the Drosophila a-amylase 
gene in the plasmid pUC-TEST. This plasmid (SEQID No: 10), shown schematically 
in Figure 10, has the chicken a-amylase gene driven by the CMV promoter, and the 
Drosophila a-amylase gene driven by the p-actin promoter. In SEQID No: 10 the 
components are located in the following positions: pUG origin of replication: 
5384-6027; p-lactamase gene: 7035-6175; p-lactamase promoter: 7105-7036; Fl 
origin: 7167-7623; synthetic polyadenylation site: 7753-7802; and pause site (human 
a2-globin): 7815-7906. 

EXAMPLE 7: The Chicken a-amylase reporter gene system 

A variety of methods are available for the delivery of the reporter gene to the 
target host cells including: 1) liposome-mediated fusion; 2) micro-injection; 3) particle 
bombardment; 4) viral vectors; and 5) calcium phosphate co-precipitation. 
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For the purpose of this non-limiting illustrative example, the reporter gene 
linked to the CMV promoter has been inserted into the plasmid pIBI25 (International 
Biotechnologies Incorporated) to produce a transient expression construct. In addition, 
the reporter constructhas been delivered to the cells using the calcium phosphate co- 
precipitation method. Other methods of DNA delivery are applicable, as is the use of 
vectors that promote the stable maintenance of the reporter gene within the transformed 
cells, e.g. retroviral vectors. 

EXAMPLE 8: Harvesting of the Reporter Activity from Serum-Free Culture 
Supernatants: 

Cells were grown under standard mammalian cell culture conditions. The 
media used to culture the cell lines were as follows: (i) for MA104 and Hela, 
Dulbecco's modified Eagle's medium (DMEM) with 10% Fetal Bovine Serum; (ii) for 
CHO, DMEM with 10% Newborn Calf Serum; (iii) for PA317 and PG13, DMEM 
with iron-enriched 10% Calf Serum. Cells were plated at a density of 5x10 s cells per 
10 cm dish. Calcium phosphate transfection was carried out overnight. Following 
transfection, cells were incubated overnight in cell line-specific media as described 
above. On day 4, the culture medium was removed and replaced with Serum-free 
DMEM for all cell lines. On day 5, samples of medium were collected, and debris 
was removed from the samples by centrifugation at 500xg. 

Cell supernatant was adjusted to 40% (vol/vol) of ethanol and the mixture 
incubated on ice for 1 hour. The solution was centrifuged at 16,000xg for 20 minutes 
at 4°C and the supernatant dried under vacuum to remove the ethanol. 100-150p,L of 
the resulting solution was used for routine analysis by native gel electrophoresis. To 
concentrate the solution, - lmg of carrier protein (e.g. BSA) was added to the sample 
and the solution applied to a Centricon-30 filtration columns (Amicon). As little as 
0.0005U of amylase processed in this manner can be detected by native gel 
electrophoresis. 
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EXAMPLE 9: Harvesting of the Reporter Activity from Serum-Containing 

Culture Supernatants: 

Cells were grown under standard mammalian cell culture conditions as 
described above, except that 10% serum was included in the culture media. The 
procedure used to harvest the reporter protein is the same as that described for 
harvesting from milk (see below). The presence of serum gives some non-specific 
activity; however, this activity is clearly distinguishable from the reporter activity 
when detected by native gel electrophoresis (described in Example 12) because it has 
a different mobility. For an absolutely clean background and for concentrating the 
sample, adsorption on P-cyclodextrin (described in Example 11) can be used. 

EXAMPLE 10: Harvesting of the Reporter Activity from Milk: 

Two alternative methods are used to recover the a-amylase activity from milk. 
In the first method, fresh milk containing a-amylase was de-fatted by centriftigation at 
80Q0xg, 10 min. at room temperature. The aqueous fraction was acid-curdled with 1M 
HC1 to a final pH of 4.5, then centrifuged at 8000xg for 20 min. The whey 
(supernatant) fraction was neutralized with 1M NaOH to a final pH of 6.7. A portion 
of the final solution was used for analysis. 

In the second method, the de-fatted milk was adjusted to 40% ethanol, placed 
on ice for l-2h, then centrifuged at 16000xg at 4°C. The supernatant was then placed 
under vacuum to remove the ethanol, and a portion of the final solution was used for 
analysis with essentially complete recovery of the reporter activity. For concentrating 
the sample, adsorption on P-cyclodextrin without removing the ethanol (described in 
Example 1 1) can be used. 

EXAMPLE 11: Harvesting of the Reporter Activity from Pre-laction 
Secretion (Pre-milk): 

Bovine pre-milk is a jelly-like substance that is so viscous it is nearly solid. To 
analyze a-amylase activity in the pre-milk, the sample was diluted 1/3-1/4 with water, 
then adjusted to 40% ethanol. The sample was placed on ice for l-2h, then centrifuged 
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at 16000xg at 4°C to remove the precipitated proteins. p-cyclodextrin-Sepharose 6B, 
prepared as described in Vretblad, P., 1974, FEBS Lett. 47(1): 86-89, was added to 
the sample at a ratio of 40^L packed beads to 4mL of prepared sample. The sample 
was shaken for 2h at room temperature, then washed 5 times in 1.5mL of wash buffer 
(50mM Tris-HCl, ImM CaCl 2 pH7.5). The sample was eluted by incubation for 5 min 
at room temperature in a desired volume of lOmg/mL p-cyclodextrin. As little as 
0.0002U of a-amylase activity per mL of pre-milk can be recovered this way. 

EXAMPLE 12: Detection of Reporter Activity 

Amylase activity can be detected and measured by a variety of simple, safe 
procedures. These can be divided into the tube or liquid assays and the assays that 
involve diffusion in a semi-solid medium or electrophoretic separation. 

The preferred protocol is electrophoretic separation of the proteins in the 
sample on a native (non-denaturing) polyacrylamide gel ("native gel electrophoresis"). 
This protocol is described in Benkel and Hickey (1986, Genetics 114: 137-144). 
Briefly, a low percentage acrylamide gel allows proteins to be separated on the basis 
of the overall charge of the molecules. Thus, even though chicken and mammalian 
amylases are very similar in molecular weight, they display very different migration 
patterns in the native gel assay (see Figure 5). 

Following electrophoretic separation, the gel is incubated in a buffer solution 
containing partially-hydrolyzed starch. The starch granules coat the gel and penetrate 
the gel surface. Staining of the starch-coated gel with iodine results in a gel that shows 
clear amylase bands on a dark blue background. 

The main advantage of the gel assay is that the background activity measured 
for the recipient cells is essentially zero. The gel assay is highly sensitive, and can 
easily be converted into a quantitative format by the incorporation of serial dilutions 
of an activity standard (see Benkel and Hickey, 1986, Genetics 114: 943-954). In 
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addition, there is a dye-linked starch-based substrate that facilitates real-time activity 
visualization. 

As an alternative to native gel electrophoresis, prepared samples containing a- 
amylase can also be assayed by a very simple "plate diffusion" method. 1% agar, 
made in 50mM Tris-HCl, ImM CaCl 2 pH 7.5, containing 0.02% starch, was poured 
onto culture plates. 4mm-diameter holes were made in the agar, into which up to 30/xL 
of sample can be placed. The plates were covered and incubated overnight at 37 °C to 
allow the amylase to diffuse. "Haloes", formed as the starch becomes digested, were 
detected by staining with iodine as described in the native gel electrophoresis 
procedure. 

Chicken amylase in cell culture medium and in milk was tested and the activity 
was found to be stable for days at room temperature in a bioactive format. In addition- 
freezing and thawing does not affect the activity of the enzyme. 

Figure 6 shows a variety of mammalian cell lines transformed with the reporter 
construct containing the CMV promoter delivered using a transient-expression vector. 
All cell lines tested including; CHO (hamster), ET-2 (cow), Hela (human), MA 104 
(monkey), PA317 and PG13 (mouse) showed production and secretion of chicken 
amylase according to the native gel assay. Different cell lines appear to secrete 
chicken amylase at different efficiencies. This may reflect the ability of the cells to 
take up foreign DNA, or it may be a measure of the inherent capacity of the different 
cell types to secrete proteins. 

Figure 7 shows the results of an experiment in which PA317 cells were co- 
transfected with two reporter constructs: one containing chicken a-amylase as a 
reporter; the other contains firefly luciferase as a well-known intracellular reporter. 
In each co-transfection, both reporters are driven by the same promoter, which is the 
murine retroviral LTR, the rat p-actin promoter, or the human CMV early immediate 
promoter (see EXAMPLE 3 above for details of the promoters). The supernatants of 
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transiently transfected cells were harvested and analyzed for reporter activity as 
described in Example 8, while cell pellets were processed and analyzed for luciferase 
activity according to the instructions supplied by the luciferase assay kit manufacturer 
(Promega Corp). The results for extracellular amylase reporter activities show that the 
expression level of the CMV promoter is roughly four times that of the [5-actin 
promoter, which in turn is about twice that of the LTR. Similar results were obtained 
with the intracellular luciferase promoter (Figure 12). 

One of the primary applications for secreted reporter genes is in the 
optimization of DNA delivery and transformation in vivo of mammalian cells that 
secrete proteins in to biological fluids. This optimization process is crucial to the 
development of successful approaches for cell transformation in gene therapy. We are 
currently exploring the use of somatic transgenesis of bovine epithelial tissue in the 
development of 'bioreaetor' cows for the production of pharmaceutical agents into 
milk. In order to compare the suitabilities of SEAP (Clontech Cat. No. K2G41-1) and 
a-amylase as reporter gene/enzyme systems for the transformation of cow udder cells, 
SEAP and a-amylase were spiked into fresh cows' milk their activities measured. The 
SEAP used to spike the milk came from cell culture supernatant; the a-amylase was 
prepared from chicken pancreas. 

Fresh cows' milk was centrifuged at 8,000 xg for 10 minutes at room 
temperature to remove the fat. SEAP cell supernatant and chicken a-amylase was 
added to the aqueous fraction. For the SEAP reconstitution experiment, SEAP activity 
was measured for the following treatments: (i) untreated milk aqueous phase 
("Endogenous" in Table 1); (ii) milk aqueous phase heated to 65 °C for 30 minutes 
("Heated"); (iii) milk aqueous phase with SEAP spike ("Spiked"); and (iv) milk 
aqueous phase with SEAP spike heated to 65 °C for 30 minutes ("Spiked/heated"). 

The results demonstrated that there is a high background of endogenous, soluble 
alkaline phosphatase activity in cows' milk. This activity is indistinguishable from 
SEAP. Heating the samples to 65 °C does not decrease the endogenous SEAP activity 
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level; instead it appears that a portion of the endogenous alkaline phosphatase activity 
is present in an inactive, protein-complexed state in milk, and that heating tends to 
promote the release of some of this protein-bound endogenous SEAP activity. This 
results in a further increase in the background SEAP levels measured in the unspiked 
5 milk. The ability of milk proteins to inactivate SEAP is also evident from the dramatic 
decrease in SEAP signal measured in the spiked samples, where only about 1/4 of the 
SEAP activity added in the spike is detectable following the addition of the milk. 
Heating the spiked samples to 65 °C raises the levels somewhat on average, but this 
increase in activity is at least partially due to the release of endogenous alkaline 
10 phosphatase enzyme. 



TABLE 1: Measurement of SEAP activity in cow's milk. 



Sample 


Endogenous 


Heated 


Spiked 


Spiked/heated 


Cow 1 


52 


60 


282 


307 


Cow 2 


59 


62 


198 


190 



Legend: Cows' milk was processed as described in the text. SEAP activities are given 
in lOOO's of luminometer units. "Endogenous" refers to the endogenous background 
activity present in the aqueous phase of the milk; "Heated" refers to the background 
20 activity in the milk following a 30 minute incubation at 65 °C; "Spiked" refers to the 
unheated milk following the addition of the SEAP spike; "Spiked/heated" represents 
the spiked sample following a 30 minute incubation at 65 °C. The SEAP spike consisted 
of 1,075 (xlOOO) units. 

25 Increasing the level of the alkaline phosphatase-inhibitor homoarginine in the 

samples had no effect on the SEAP activity measurements. On the other hand, acid 
curdling of the milk to remove caseins, a standard method used to fractionate milk, 
resulted in the loss of the bulk of the SEAP spike. Likewise, adjusting the milk to 
40% ethanol eliminates the SEAP spike, which precipitated out with the non-specific 

30 proteins. 



Figure 8 shows milk samples with and without the addition of chicken a- 
amylase preparations. For whole milk, the gel assay shows a light smear throughout 
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the entire lane (see lane 1). This is due to the effect of the abundant milk casein 
proteins. However, the band of activity corresponding to the chicken amylase reporter 
'spike' is clearly visible even in whole milk (lane 2). Our standard protocol for the 
amylase reconstitution experiments included an acid curdling step to remove caseins 
from the whole milk. This treatment was incompatible with the SEAP reporter system 
because it almost entirely depleted the milk aqueous phase of alkaline phosphatase 
activity (see above). 

Following neutralization, the whey containing the chicken a-amylase spike was 
analyzed by the standard gel assay. This treatment completely eliminated any 
endogenous background activity from the milk samples, and left the chicken amylase 
band clearly visible in the spiked samples (lanes 7&8). Thus, chicken a-amylase is 
superior to SEAP as a secreted reporter enzyme in biological fluids such as cow's 
milk. 

As outlined above, a major advantage of the amylase reporter system over 
existing reporter assays is the ability to detect chicken amylase activity in complex 
biological fluids. Cow's pre-lactation secretions (also known as pre-milk) are 
particularly problematic biological samples due to their extremely high content of 
proteins, fats and carbohydrates. However, using the simple p-cylodextrin adsorption 
procedure described in Example 11, one can recover the chicken a-amylase reporter 
activity amylase quantitatively even from such recalcitrant materials such as early and 
late pre-lactation secretions of pregnant heifers (see Figure 9). Note the complete lack of 
background activity for chicken amylase in the pre-milk of untreated udder quarters 
(lanes 1 & 6), and the presence of reporter activity only in udder quarters that were trans- 
infected with constructs encoding chicken a-amylase (lanes 2, 3, 4 & 5). In this way, 
as little as 0.0002U of a-amylase activity per mL of pre-milk can be detected. 

Figure 10 gives a schematic representation of the dual reporter constructs used 
for promoter strength determinations in transiently transfected, culture cells. To 
measure the strength of a "test" promoter, the pUC-TEST plasmid is used. It consists 
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of two expression cassettes. The Drosophila a-amylase cassette has the Drosophila a- 
amylase cDNA, including the signal sequence and the terminator region. There is no 
promoter driving this reporter gene; instead, a polylinker region is provided 
immediately 3 ' of the cDNA to facilitate insertion of a promoter sequence of interest. 
5 Just upstream of this polylinker region is a partially synthetic "transcriptional blocker" 
designed to minimize non-specific transcription of a-amylase. The Drosophila a- 
amylase gene is, therefore, the "test" isozyme. The second cassette has the chicken 
a-amylase cDNA joined at the 5' end with the signal sequence from Drosophila 
melanogaster; the terminator region for this cassette comes from the late 
10 polyadenylation site of SV40, to minimize possible recombination with the Drosophila 
a-amylase cassette. The chicken a-amylase gene is driven by the CMV promoter, and 
therefore, acts as the "normalizing" isozyme. The backbone of the plasmid pUC- 
based. 

15 To provide a control showing that both cassettes are functional, the plasmid 

pUC-CONTROL is used. This plasmid is identical to pUC-TEST, except that the rat 
P-actin promoter was inserted upstream of the Drosophila a-amylase. 

Figure 11 shows the results of transfecting PA317 cells in culture with the dual 
20 reporter construct. Lane 3 shows the lack of background activity in the culture medium 
of untransfected PA317 cells. Lane 4 contains culture medium from PA317 cells 
transfected with the plasmid pUC-TEST. The activity band is specific to the 
normalizing activity resulting from the expression of the chicken amylase reporter gene 
under the control of the CMV promoter. Lane 5 contains culture medium from PA317 
25 cells transfected with the vector pUC-CONTROL. The chicken amylase band results 
from the expression of the normalizing isozyme while the Drosophila melanogaster 
(fly) amylase activity band results from the expression of the fly gene under the control 
of the p-actin promoter. The relative signal intensities of the two activity bands within 
the same gel provide a direct comparison of the transcriptional activities of the CMV 
30 and p-actin promoters. The P-actin promoter was inserted into pUC-CONTROL for 
the purposes of this example only. It will be obvious to persons skilled in the art that 
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the pUC-TEST assay system can be used to assay any nucleotide sequence whether 
natural or synthetic for promoter activity. Furthermore, by employing host cells other 
than PA317 cell line, the assay can be used to evaluate the effect of host cell type on 
promoter activity. In this particular instance, 80/xL of culture supernatant, processed 
as described in Example 8, was used. 

All scientific publications and patent documents are incorporated herein by 
reference. 

The present invention has been described with regard to preferred embodiments. 
However, it will be obvious to persons skilled in the art that a number of variations and 
modifications can be made without departing from the scope of the invention as 
described in the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Her Majesty in Right of Canada as Rep. by 

Agriculture and Agri-Food Canada 

(B) STREET: Central Experimental Farm 

(C) CITY: Ottawa 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) POSTAL CODE (ZIP) : K1A 0C6 

(A) NAME: The University of Ottawa 

(B) STREET: 115 Seraphin Marion 

(C) CITY: Ottawa 

(D) STATE: Ontario 

( E ) COUNTRY : Canada 

(F) POSTAL CODE (ZIP) : KIN 6N5 

(ii) TITLE OF INVENTION: Secreted alpha-Amylase as a Reporter Gene 
(iii) NUMBER OF SEQUENCES: 10 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.3 0 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: CA 0,000,000 
(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: CM 2,203,613 

(B) FILING DATE: 24 -APR- 1997 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GCTAGCTCAG TACAATCCCA ACACTCAGGC TGGGAGGACA TCTATCGTGC ATCTCTTTGA 
ATGGCG CTGG GCCGACATTG CACTGGAGTG CGAACACTAT TTAGCTCCTA ATGGGTTTGG 



60 



120 
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AGGAGTTCAG GTTTCTCCTC CAAATGAAAA CATTGTCATT ACTAATCCGA ACAGGCCCTG 180 

GTGGGAAAGA TACCAGCCCA TCAGCTACAA GATCTGCAGT CGATCGGGCA ATGAAAATGA 240 

ATTCAGAGAC ATGGTGACCA GATGCAACAA TGTTGGAGTT CGTATTTATG TGGATGCTGT 300 

TGTCAATCAC ATGTGTGGAT CTATGGGTGG CACGGGCACC CACTCAACAT GTGGGAGCTA 36 0 

TTTCAACACC GGGACTAGAG ATTTTCCCGC TGTGCCGTAC TCTGCCTGGG ATTTCAATGA 420 

CGGCAAATGT CACACTGCAA GTGGAGACAT CGAAAATTAT GGGGACATGT ATCAGGTCCG 48 0 

GGATTG CAAG TTGTCCAGCC TTCTTGATCT GGCTCTGGAG AAGGACTATG TACGCTCAAC 540 

AATTGCAGCG TACATGAATC ACCTCATTGA TATGGGTGTA GCAGGGTTCC GGATCGATGC 60 0 

TGC CAAG CAT ATGTGGCCAG GGGACATAAG AGCGTTTCTG G AC AAACTG C ACGATCTAAA 66 0 

TACTCAGTGG TTTTCAGCAG GAACGAAACC CTTTATTTAC CAAGAGGTAA TTGACTTGGG 72 0 

AGGAGAGCCA ATCACAGGCA GTCAGTACTT TGGGAATGGC CGCGTGACAG AATTCAAGTA 780 

TGGTGCCAAA CTGGGGACGG TGATCCGGAA GTGGAATGGA GAGAAGATGG CCTACTTAAA 84 0 

GAACTGGGGA GAAGGCTGGG GCTTTGTGCC TTCTGACAGA GCCCTGGTGT TTGTGGATAA 900 

CCACGACAAC CAGCGGGGGC ACGGGGCAGG CGGAGCTTCC ATTCTTACTT TCTGGGATGC 96 0 

CAGGCTTTAT AAAATGGCGG TTGGTTTCAT GCTCGCTCAT CCGTACGGGT TCACACGGGT 102 0 

GATGTCAAGT TATCGTTGGC CAAGATATTT CGAAAACGGA GTGGATGTTA ACGACTGGGT 108 0 

GGGACCACCA AGTAACTCGG ACGGATCGAC GAAGTCCGTT ACAATCAACG CAGACACTAC 114 0 

CTGTGGCAAT GACTGGGTCT GCGAACATCG CTGGCGACAA ATAAGGAACA TGGTTATCTT 12 0 0 

CCGTAATGTG GTAGACGGTC AGCCTTTCTC AAACTGGTGG GACAACGGGA GCAATCAAGT 1260 

AGCTTTCGGT CGCGGCGACA GAGGCTTCAT TGTCTTTAAT AATGATGACT GGTATATGAA 132 0 

TGTCGATTTG CAAACTGGTC TGCCTGCTGG AACCTACTGC GATGTTATTT CTGGACAAAA 13 8 0 

GGAAGGCAGT GCGTGTACTG GAAAGCAGGT GTACGTTTCC TCGGATGGAA AGGCCAATTT 144 0 

CCAGATTAGT AACAGCGATG AAGATCCATT TGTTG CAATT CACGTTGATG CCAAGTTATA 1500 

AGCTT 1505 
(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 
AAGCTTGTAA ACAGCTGGGG AGCATGGCGA ACAGC CAGGC AATTAATTGA GATTATTAAT 6 0 

TGTACGAAAT ATATATGATG AGATTATAAA CACACAACAC TTTTATT CGC AAGGGATGAT 12 0 

AAGAATCTAA TATATATATT ATCTGGGCTT CAAAGCATTG ATTTTATTTA TTGAGTCAAG 180 

AGGGAAATTT ATTTTCTTGT TATTCTCTGT CCAGGTCTAA AGTCCCGAGC GGTGAGGCTA 24 0 

TCTATTGATT TGGACATTCC AATCGAATAC AAAACAGAGA TACAGAAATT TGCGAAAAAA 3 00 
TTTGATAACA ATCGTGGATT TTACGAATTA GACAAATTGA TATGTGCTTG CTAATTGATG ' 36 0 

TGGCATGAAA TAAGAAATTT ATAAGGACGT TTTCAAGTGC TTCTATTTTA AACATTCAGG 42 0 

ATTTTTTTTT AAAGCAGACA GCTTTCAACA GGTTTGATGA GAATTTGAAT ATTGATTGTT 4 80 

GACTTTAGCT ATACATAAAT CACACCTCAT CCACCCATTG TGGTATCCTT CGAAGGACTT 54 0 

GGGAACTGGA TCCTCTAGAA GGCGCGCC 56 8 
(2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
ATGCTAGCTC AGTACAATCC CAACACTCAG GCT 3 3 

(2) INFORMATION FOR SEQ ID NO : 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
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CGAAGCTTAT AACTTGG CAT CAACGTGAAT TG 
(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CCATGGTTCT GGCCAAGAGC ATAGTGTGCC TCGCCCTCCT GGCGGTGGCG CTAGCT 56 
(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 
ATGCTAGCGC AATTCGACAC CAACTACGCA 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CAAGCTTGTT TACAACTTGG CGTTGACGT 
(2) INFORMATION FOR SEQ ID NO : 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 9 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
ATGCTAGCGC AATTCGACAC CAACTACGCA TCCGGTCGTA GTGGAATGGT CCACCTCTTC 6 0 

GAGTGGAAGT GGGACGACAT CGCTGCCGAG TGCGAAAACT TCCTTGGACC CAATGGCTAC 12 0 

GCCGGTGTTC AGGTCTCCCC TGTGAACGAG AACGCCGTCA AGGAC AG CCG CCCCTGGTGG 180 
GAACGTTACC AGCCCATCTC CTACAAGCTG GAGACCCGCT CCGGAAACGA AGAGCAGTTC 24 0 

GCCAGCATGG TCAAGCGCTG CAACGCCGTC GGAGTGCGCA CCTACGTGGA CGTGGTCTTC 3 00 

AACCACATGG CCGCCGACGG AGGCACCTAC GGCACTGGCG GCAGCACCGC CAGCCCCAGC 3 60 

AGCAAGAGCT ATCCCGGAGT GCCCTACTCC TCGCTGGACT TCAACCCGAC CTGCGCCATC 42 0 

AGCAACTACA ACGACGCCAA CGAGGTGCGC AACTGCGAGC TGGTCGGTCT GCGCGACCTT 4*8 0 

AACCAGGGCA ACTCCTACGT GCAGGACAAG GTGGTCGAGT TCCTGGACCA TCTGATTGAT 54 0 

CTCGGCGTGG CCGGATTCCG "CGTGGACGCC GCCAAGCACA TGTGGCCCGC CGACCTGGCC 6 00 

GTCATCTATG GCCGCCTCAA G AAC CTAAAC ACCGACCACG GCTTCGCCTC GGGATCCAAG 66 0 

GCGTACATCG TCCAGGAGGT CATCGACATG GGCGGCGAGG CCATCAGCAA GTCCGAGTAC 72 0 

ACCGGACTGG GCGCCATCAC CGAGTTCCGC CACTCCGACT CCATCGGCAA GGTCTTCCGC 7 80 

GGCAAGGACC AGCTGCAGTA CCTGACCAAC TGGGGCACCG CCTGGGGCTT CGCTGCCTCC 84 0 

GACCGCTCCC TGGTATTCGT CGACAACCAC GACAATCAGC GCGGACATGG AG C AGGAGG C 90 0 

GCCGACGTTC TGACCTACAA GGTGCCCAAG CAGTACAAGA TGGCCTCCGC CTTCATGCTG 96 0 

GCGCACCCCT TCGGCACTCC CCGCGTGATG TCCTCCTTCT CCTTCACGGA CACCGATCAG 102 0 

GGCCCGCCCA CCACCGACGG CCACAACATC GCCTCGCCCA TCTTCAATAG CGACAACTCC 108 0 

TGCAGCGGCG GCTGGGTGTG TGAGCACCGC TGGCGCCAGA TCTACAACAT GGTGGCCTTC 114 0 

CGAAACACCG TGGGCTCGGA CGAGAT C C AG AACTGGTGGG ACAACGGCAG CAACCAGATC 12 00 

TCCTTCAGCC GAGGCAGCCG CGGCTTCGTG GCCTTCAACA ACGACAACTA CG AC C TG AAC 126 0 

AGCTCCCTGC AGACGGGCCT GCCCGCCGGC ACCTACTGCG ACGTCATCTC CGGCTCCAAG 13 2 0 

AGCGGTTCCT CCTGCACGGG CAAGACCGTC ACCGTCGGAT CCGACGGACG GGCTTCCATC 13 8 0 
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AACATTGGCA GCTCCGAGGA CGACGGAGTG CTGGCCATTC ACGTCAACGC CAAGTTGTAA 
ACAAGCTTG 

(2) INFORMATION FOR SEQ ID NO : 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7562 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 

GGTACCGAGC TCTTACGCGT GCTAGCCCGG GCTCGAGCGG CCGCCAGTGT GATGGATATC 6 0 

TGCAGAATTC GGCTTAGTCG ACCCATGGTT CTGGC CAAG A GCATAGTGTG CCTCGCCCTC 12 0 

CTGGCGGTGG CGCTAGCCCA ACGCCCAATT CGACACCAAC TACGCATCCG GTCGTAGTGG 18 0 

AATGGTCCAC CTCTTCGAGT GGAAGTGGGA CGACATCGCT GCCGAGTGCG AAAACTTCCT 24 0 

TGGACCCAAT GGCTACGCCG GTGTTCAGGT CTCCCCTGTG AACGAGAACG CCGTCAAGGA 3 00 

CAGCCGCCCC TGGTGGGAAC GTTACCAGCC CATCTCCTAC AAGCTGGAGA CCCGCTCCGG 360 

AAACGAAGAG CAGTTCGCCA GCATGGTCAA GCGCTGCAAC GCCGTCGGAG TGCGCACCTA 4 20 

CGTGGACGTG GTCTTCAACC ACATGGCCGC CGACGGAGGC ACCTACGGCA CTGGCGGCAG 48 0 

CACCGCCAGC CCCAGCAGCA AGAGCTATCC CGGAGTGCCC TACTCCTCGC TGGACTTCAA 54 0 

CCCGACCTGC GCCATCAGCA ACTACAACGA CGCCAACGAG GTGCGCAACT GCGAGCTGGT 6 00 

CGGTCTGCGC GACCTTAACC AGGGCAACTC CTACGTGCAG GACAAGGTGG TCGAGTTCCT 66 0 

GGACCATCTG ATTGATCTCG GCGTGGCCGG ATTCCGCGTG GACGCCGCCA AG CACATGTG 720 

GCCCGCCGAC CTGGCCGTCA TCTATGGCCG CCTCAAGAAC CTAAACACCG ACCACGGCTT 78 0 

CGCCTCGGGA TCCAAGGCGT ACATCGTCCA GGAGGTCATC GACATGGGCG GCGAGGCCAT 84 0 

CAGCAAGTCC GAGTACACCG GACTGGGCGC CATCACCGAG TTCCGCCACT CCGACTCCAT 90 0 

CGGCAAGGTC TTCCGCGGCA AGGAC CAGCT GCAGTACCTG ACCAACTGGG GCACCGCCTG 96 0 

GGGCTTCGCT GCCTCCGACC GCTCCCTGGT ATTCGTCGAC AACCACGACA ATCAGCGCGG 1020 

ACATGG AG C A GGAGGCGCCG ACGTTCTGAC CTACAAGGTG CCCAAGCAGT ACAAGATGGC 108 0 

CTCCGCCTTC ATGCTGGCGC ACCCCTTCGG CACTCCCCGC GTGATGTCCT CCTTCTCCTT 1140 
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CACGGACACC GATCAGGGCC CGCCCACCAC CGACGGCCAC AACATCGCCT CGCCCATCTT 12 0 0 

CAATAGCGAC AACTCCTGCA GCGGCGGCTG GGTGTGTGAG CACCGCTGGC GCCAGATCTA 1260 

CAACATGGTG GCCTTCCGAA ACACCGTGGG CTCGGACGAG ATCCAGAACT GGTGGGACAA 13 2 0 

CGGCAGCAAC CAGATCTCCT TCAGCCGAGG CAGCCGCGGC TTCGTGGCCT TCAACAACGA 13 80 

CAACTACGAC CTGAACAGCT CCCTGCAGAC GGGCCTGCCC GCCGGCACCT ACTGCGACGT 144 0 

CATCTCCGGC TCCAAGAGCG GTTCCTCCTG CACGGGCAAG ACCGTCACCG TCGGATCCGA 15 00 

CGGACGGGCT TCCATCAACA TTGGCAGCTC CGAGGACGAC GGAGTGCTGG CCATTCACGT 156 0 

CAACGCCAAG TTGTAAGCTT GTAAACAGCT GGGGAGCATG GCGAACAGCC AGGCAATTAA 162 0 

TTGAGATTAT TAATTGTACG AAATATATAT GATGAGATTA TAAACACACA ACACTTTTAT 16 8 0 

TCGCAAGGGA TGATAAGAAT CTAATATATA TATTATCTGG GCTTCAAAGC ATTGATTTTA 17 4 0 

TTTATTGAGT CAAGAGGGAA ATTTATTTTC TTGTTATTCT CTGTCCAGGT CTAAAGTCCC 18 00 

GAGCGGTGAG GCTATCTATT GATTTGGACA TTCCAATCGA ATACAAAACA GAGATACAGA 186 0 

AATTTCGGAA AAAATTTGAT AACAATCGTG GATTTTAGCA ATTAGACAAA TTGATATGTG 19 20 

CTTGCTAATT GATGTGG CAT GAAATAAGAA ATTTATAAGG ACGTTTTCAA GTGCTTCTAT 19 80 

TTTAAACATT CAGGATTTTT TTTTAAAGCA GACAGCTTTC AACAGGTTTG ATCAGAATTT 2 040 

GAATATTGAT TGTTGACTTT AGCTATACAT AAATCACACC TCATCCACCC ATTGTGGTAT 2 1 0 0 

CCTTCGAAGG ACTTGGGAAC TGGATCCTCT AGAGCTTCCG GCCATTAGCC ATATTATTCA 2160 

TTGGTTATAT AGCATAAATC AATATTGGCT ATTGGCCATT GCATACGTTG TATCCATATC 22 2 C 

ATAATATGTA CATTTATATT GGCTCATGTC CAACATTACC GCCATGTTGA CATTGATTAT 2 2 80 

TGACTAGTTA TTAATAGTAA TCAATTACGG GGTCATTAGT TCATAGCCCA TATATGGAGT 2 34 C 

TCCGCGTTAC ATAACTTACG GTAAATGGCC CGCCTGGCTG ACCGCCCAAC GACCCCCGCC 2 4 00 

CATTGACGTC AATAATGACG TATGTTCCCA TAGTAACGCC AATAGGGACT TTCCATTGAC 24 6 0 

GTCAATGGGT GGAGTATTTA CGGTAAACTG CCCACTTGGC AGTACATCAA GTGTATCATA 2 520 

TGCCAAGTAC GCCCCCTATT GACGTCAATG ACGGTAAATG GCCCGCCTGG CATTATGCCC 2 580 

AGTACATGAC CTTATGGGAC TTTCCTACTT GGCAGTACAT CTACGTATTA GTCATCGCTA 264 0 

TTACCATGGT GATGCGGTTT TGGCAGTACA TCAATGGGCG TGGATAGCGG TTTGACTCAC 2 7 00 

GGGGATTTCC AAGTCTCCAC CCCATTGACG TCAATGGGAG TTTGTTTTGG CACCAAAATC 2 76 0 

AACGGGACTT- TCCAAAATGT CGTAACAACT CCGCCCCATT GACGCAAATG GGCGGTAGGC 2 82 0 
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ATGTACGGTG GGAGGTCTAT ATAAGCAGAG CTCGTTTAGT GAACCGTCAG ATCGCCTGGA 28 80 

GACGCCATCC ACGCTGTTTT G AC CTC CAT A GAAGACACCG GGACCGATCC AGCCTCCGCG 2 94 0 

GCCCCAAGCT CATGGTTCTG GCCAAGAGCA TAGTGTGCCT CGCCCTCCTG GCGGTGGCGC 3 0 00 

TAGCCCAGTA CAATCCCAAC ACTCAGGCTG GGAGGACATC TATCGTGCAT CTCTTTGAAT 306 0 

GGCGCTGGGC CGACATTGCA CTGGAGTGCG AACACTATTT AGCTCCTAAT GGGTTTGGAG 312 0 

GAGTTCAGGT TTCTCCTCCA AATGAAAACA TTGTCATTAC TAATCCGAAC AGGCCCTGGT 318 0 

GGGAAAGATA CCAGCCCATC AG CT AC AAG A TCTGCAGTCG ATCGGGCAAT GAAAATGAAT 3 24 0 

TCAGAGACAT GGTGACCAGA TGCAACAATG TTGGAGTTCG TATTTATGTG GATGCTGTTG 33 00 

TCAATCACAT GTGTGGATCT ATGGGTGGCA CGGGCACCCA CTCAACATGT GGGAGCTATT 3 36 0 

TCAACACCGG GACTAGAGAT TTTCCCGCTG TGCCGTACTC TGCCTGGGAT TTCAATGACG 342 0 

GCAAATGTCA CACTGCAAGT GGAGACATCG AAAATTATGG GGACATGTAT CAGGTCCGGG 348 0 

ATTGCAAGTT GTCCAGCCTT CTTGATCTGG CTCTGGAGAA GGACTATGTA CGCTCAACAA 3 54 0 

TTGCAGCGTA CATGAATCAC CTCATTGATA TGGGTGTAGC AGGGTTCCGG ATCGATGCTG 36 00 

CCAAGCATAT GTGGCCAGGG GACATAAGAG CGTTTCTGGA CAAACTGCAC GATCTAAATA 366 0 

CTCAGTGGTT TTCAGCAGGA ACGAAACCCT TTATTTACCA AGAGGTAATT GACTTGGGAG 372 0 

GAGAGCCAAT CACAGGCAGT CAGTACTTTG GGAATGGCCG CGTGACAGAA TTCAAGTATG 3 78 0 

GTGCCAAACT GGGGACGGTG ATCCGGAAGT GGAATGGAGA GAAGATGGCC TACTTAAAGA 3 84 0 

ACTGGGGAGA AGGCTGGGGC TTTGTGCCTT CTGACAGAGC CCTGGTGTTT GTGGATAACC 3 900 

ACGACAACCA GCGGGGGCAC GGGGCAGGCG GAGCTTCCAT TCTTACTTTC TGGGATGCCA 3 96 0 

GGCTTTATAA AATGGCGGTT GGTTTCATGC TCGCTCATCC GTACGGGTTC ACACGGGTGA 4 02 0 

TGTCAAGTTA TCGTTGGCCA AGATATTTCG AAAACGGAGT GGATGTTAAC GACTGGGTGG 4 08 0 

GACCACCAAG TAACTCGGAC GGATCGACGA AGTC CGTTAC AATCAACGCA GACACTACCT 414 0 

GTGGCAATGA CTGGGTCTGC GAACATCGCT ' GGCGACAAAT AAGGAACATG GTTATCTTCC 42 0 0 

GTAATGTGGT AGACGGTCAG CCTTTCTCAA ACTGGTGGGA CAACGGGAGC AATCAAGTAG 42 60 

CTTTCGGTCG CGGCGACAGA GGCTTCATTG TCTTTAATAA TGATGACTGG TATATGAATG 432 0 

TCGATTTGCA AACTGGTCTG CCTGCTGGAA CCTACTGCGA TGTTATTTCT GGACAAAAGG 43 8 0 

AAGGCAGTGC GTGTACTGGA AAGCAGGTGT ACGTTTCCTC GGATGGAAAG GCCAATTTCC 444 0 

AGATTAGTAA CAGCGATGAA GATCCATTTG TTGCAATTCA CGTTGATGCC AAGTTATAAG 4500 
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GCCGGCCGCT TCGAGCAGAC ATGATAAGAT ACATTGATGA GTTTGGACAA ACCACAACTA 4 56 0 

GAATGCAGTG AAAAAAATGC TTTATTTGTG AAATTTGTGA TGCTATTGCT TTATTTGTAA 462 0 

CCATTATAAG CTGCAATAAA CAAGTTAACA ACAACAATTG CATTCATTTT ATGTTTCAGG 468 0 

TTCAGGGGGA " GGTGTGGGAG GTTTTTTAAA GCAAGTAAAA CCTCTACAAA TGTGGTAAAA 4740 

TCGATAAGGA TCCGTCGACC GATGCCCTTG AGAGCCTTCA ACCCAGTCAG CTCCTTCCGG 480 0 

TGGGCGCGGG GCATGACTAT CGTCGCCGCA CTTATGACTG TCTTCTTTAT CATGCAACTC 4 86 0 

GTAGGACAGG TGCCGGCAGC . GCTCTTCCGC TTCCTCGCTC ACTGACTCGC TGCGCTCGGT 4 92 0 

CGTTCGGCTG CGGCGAGCGG TATCAG CTC A CTCAAAGGCG GTAATACGGT TATCCACAGA 498 0 

ATCAGGGGAT AACGCAGGAA AGAACATGTG AGCAAAAGGC CAGCAAAAGG CCAGGAACCG 5 04 0 

TAAAAAGGCC GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC CCCCCTGACG AG CATC AC AA 5100 

AAATCGACG C TCAAGTCAGA GGTGGCGAAA CCCGACAGGA CTATAAAGAT ACCAGGCGTT 516 0 

TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC TGTTCCGACC CTGCCGCTTA CCGGATACCT 52 2 0 

GTCCGCCTTT CTCCCTTCGG GAAGCGTGGC GCTTTCTCAT AGCTCACGCT GTAGGTATCT 52 8 0 

CAGTTCGGTG TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAACCCC CCGTTCAGCC 5 3 4.0 

CGACCGCTGC GCCTTATCCG GTAACTATCG TCTTGAGTCC AACCCGGTAA GACACGACTT 54 0 0 

ATCGCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA GCGAGGTATG TAGGCGGTGC 54 6 0 

TACAGAGTTC TTGAAGTGGT GGCCTAACTA CGGCTACACT AGAAGGACAG TATTTGGTAT 552 0 

CTGCGCTCTG CTGAAGCCAG TTACCTTCGG AAAAAGAGTT GGTAGCTCTT GATCCGGCAA 5 58 0 

ACAAACCACC GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG CAGCAGATTA CGCGCAGAAA 564 0 

AAAAGG AT CT CAAGAAGATC CTTTGATCTT TTCTACGGGG TCTGACGCTC . AGTGGAACGA 5 7 00 

AAACTCACGT TAAGGGATTT TGGTCATGAG ATTATCAAAA AGGATCTTCA CCTAGATCCT 57 6 0 

TTTAAATTAA AAATGAAGTT TTAAATCAAT CTAAAGTATA TATGAGTAAA CTTGGTCTGA 5 82 0 

CAGTTACCAA TGCTTAATCA GTGAGGCACC TATCTCAGCG ATCTGTCTAT TTCGTTCATC 5 88 0 

CATAGTTGCC TGACTCCCCG TCGTGTAGAT AACTACGATA CGGGAGGGCT TACCATCTGG 5 94 0 

CCCCAGTGCT GCAATGATAC CGCGAGACCC ACGCTCACCG GCTCCAGATT TATCAG CAAT 6 000 

AAACCAGCCA GCCGGAAGGG CCGAGCGCAG AAGTGGTCCT GCAACTTTAT CCGCCTCCAT 6 06 0 

CCAGTCTATT AATTGTTGCC GGGAAGCTAG AGTAAGTAGT TCGCCAGTTA ATAGTTTGCG 6120 

CAACGTTGTT GCCATTGCTA CAGGCATCGT GGTGTCACGC TCGTCGTTTG GTATGGCTTC 6180 
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ATTCAGCTCC GGTTCCCAAC GATCAAGGCG AGTTACATGA TCCCCCATGT TGTGCAAAAA 6240 

AGCGGTTAGC TCCTTCGGTC CTCCGATCGT TGTCAGAAGT AAGTTGGCCG CAGTGTTATC 63 00 

ACTCATGGTT ATGGCAGCAC TGCATAATTC TCTTACTGTC ATGCCATCCG TAAGATGCTT 636 0 

TTCTGTGACT GGTGAGTACT CAACCAAGTC ATTCTGAGAA TAGTGTATGC GGCGAC CGAG 6420 

TTGCTCTTGC CCGGCGTCAA TACGGGATAA TACCGCGCCA CATAGCAGAA CTTTAAAAGT 64 80 

GCTCATCATT GGAAAACGTT CTTCGGGGCG AAAACTCTCA AGGATCTTAC CGCTGTTGAG 654 0 

ATCCAGTTCG ATGTAAC CCA CTCGTGCACC CAACTGATCT TCAGCATCTT TTACTTTCAC 66 0 0 

CAGCGTTTCT GGGTGAGCAA AAACAGGAAG GCAAAATGCC GCAAAAAAGG GAATAAGGGC 6660 

GACACGGAAA TGTTGAATAC TCATACTCTT C CTTTTTCAA TATTATTGAA GCATTTATCA 6 72 0 

GGGTTATTGT CTCATGAGCG GATACATATT TGAATGTATT TAGAAAAATA AACAAATAGG 67 8 0 

GGTTCCGCGC ACATTTCCCC GAAAAGTGCC ACCTGACGCG CCCTGTAGCG GCGCATTAAG 6 84 0 

CGCGGCGGGT GTGGTGGTTA CGCGCAGCGT GACCGCTACA CTTGCCAGCG CCCTAGCGCC 6 900 

CGCTCCTTTC GCTTTCTTCC CTTCCTTTCT CGCCACGTTC GCCGGCTTTC CCCGTCAAGC 6 96 0 

TCTAAATCGG GGGCTCCCTT TAGGGTTCCG ATTTAGTGCT TTACGGCACC TCGACCCCAA 7 02 0 

AAAACTTGAT TAGGGTGATG GTTCACGTAG TGGGCCATCG CCCTGATAGA CGGTTTTTCG 7080 

CCCTTTGACG TTGGAGTCCA CGTTCTTTAA TAGTGGACTC TTGTTCCAAA CTGGAACAAC 714 0 

ACTCAACCCT ATCTCGGTCT ATTCTTTTGA TTTATAAGGG ATTTTGCCGA TTTCGGCCTA 72 00 

TTGGTTAAAA AATGAGCTGA TTTAACAAAA ATTTAACGCG AATTTTAACA AAATATTAAC 726 0 

GTTTACAATT TCCCATTCGC CATTCAGGCT GCGCAACTGT TGGGAAGGGC GATCGGTGCG 7320 

GGCCTCTTCG CTATTACGCC AGCCCAAGCT ACCATGATAA GTAAGTAATA TTAAGGTACG 7 3 80 

GGAGGTACTT GGAGCGGCCG CAATAAAATA TCTTTATTTT CATTACATCT GTGTGTTGGT 74 4 0 

TTTTTGTGTG AATCGATAGT ACTAACATAC GCTCTCCATC AAAACAAAAC GAAACAAAAC 7 5 00 

AAACTAGCAA AATAGGCTGT CCCCAGTGCA AGTGCAGGTG C C AG AAC ATT TCTCTATCGA 7 56 0 

TA 7562 
(2) INFORMATION FOR SEQ ID NO : 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7913 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNES S : doubl e 
(D) TOPOLOGY: circular 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGTACCGAAA CGCGCGAGGC AGCTGATCTG AGTCACTGAA TGGGATAGTG TCCACAAGGG 6 0 

CGGGGG CT AT TCTTGTCCAT CTGGGCCTAC GGAACCAGCA CCCATCGCCA AACTCTTCAT 12 0 

CCTCTTCCTC AATCTCGCTT TCTCTCTCGT CCGCTTTTTT TTTCTTCTTC TTTTTTTTTT 18 0 

TTTTTTTTTT TTTTTTTTG C AAAAGGAGGG GAGAGGGGGT AAAAAAATGC TGCACTGTGC 24 0 

GGCGAGGCCG GTGAGTGAGC GACGCGGAGC CAATCAGCGC CCGCCGTTCC GAAAGTTGCC 300 

TTTTATGGCT CGAGTGGCCG CTGTGGCGTC CTATAAAACC CGGCGGCGCA ACGCGCAGCC 360 

ACTGTCGAGT CAAGCTTCGA TCGTCG AG CG GCCGCCAGTG TGATGGATAT CTGCAGAATT 42 0 

CGGCTTAGTC GACCCATGGT TCTGGCCAAG AGCATAGTGT GCCTCGCCCT CCTGGCGGTG 48 0 

GCGCTAGCCC AACGCCCAAT TCGACACCAA CTACGCATCC GGTCGTAGTG GAATGGTCCA 54 0 

CCTCTTCGAG TGGAAGTGGG ACGACATCGC TGCCGAGTGC GAAAACTTCC TTGGACCCAA 6 00 

TGGCTACGCC GGTGTTCAGG TCTCCCCTGT GAACGAGAAC GCCGTCAAGG ACAGCCGCCC 66 0' 

CTGGTGGGAA CGTTACCAGC CCATCTCCTA CAAGCTGGAG ACCCGCTCCG GAAACGAAGA 72 0 

GCAGTTCGCC AGCATGGTCA AGCGCTGCAA CGCCGTCGGA GTGCGCACCT ACGTGGACGT 78 0 

GGTCTTCAAC CACATGGCCG CCGACGGAGG CACCTACGGC ACTGGCGGCA GCACCGCCAG 84 0 

CCCCAGCAGC AAGAG CTATC CCGGAGTGCC CTACTCCTCG CTGGACTTCA ACCCGACCTG 90 0 

CGCCATCAGC AACTACAACG ACGCCAACGA GGTGCGCAAC TGCGAGCTGG TCGGTCTGCG 96 0 

CGACCTTAAC CAGGGCAACT CCTACGTGCA GGACAAGGTG GTCGAGTTCC TGGACCATCT 102 0 

GATTGATCTC GGCGTGGCCG GATTCCGCGT GGACGCCGCC AAGCACATGT GGCCCGCCGA 1080 

CCTGGCCGTC ATCTATGGCC GCCTCAAGAA CCTAAACACC GAC CACGGCT TCGCCTCGGG 114 0 

ATCCAAGGCG TACATCGTCC AGGAGGTCAT CGACATGGGC GGCGAGGCCA TCAGCAAGTC 12 00 

CGAGTACACC GGACTGGGCG CCATCACCGA GTTCCGCCAC TCCGACTCCA TCGGCAAGGT 1260 

CTTCCGCGGC AAGGACCAGC TGCAGTACCT GACCAACTGG GGCACCGCCT GGGGCTTCGC 1320 

TGCCTCCGAC CGCTCCCTGG TATTCGTCGA CAACCACGAC AATCAGCGCG G AC ATGG AG C 13 8 0 

AGGAGGCGCC GACGTTCTGA CCTACAAGGT GCCCAAGCAG TACAAGATGG CCTCCGCCTT 14 4 0 

CATGCTGGCG CACCCCTTCG GCACTCCCCG CGTGATGTCC TCCTTCTCCT TCACGGACAC 15 00 
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CGATCAGGGC CCGCCCACCA CCGACGGCCA CAACATCGCC TCGCCCATCT TCAATAGCGA 
CAACTCCTGC AGCGGCGGCT GGGTGTGTGA GCACCGCTGG CGCCAGATCT ACAACATGGT 
GGCCTTCCGA AACACCGTGG GCTCGGACGA GATCCAGAAC TGGTGGGACA ACGGCAGCAA 
CCAGATCTCC TTCAGCCGAG GCAGCCGCGG CTTCGTGGCC TTCAACAACG ACAACTACGA 
CCTGAACAGC TCCCTGCAGA CGGGCCTGCC CGCCGGCACC TACTG CGACG TCATCTCCGG 
CTCCAAGAGC GGTTCCTCCT GCACGGGCAA GACCGTCACC GTCGGATCCG ACGGACGGGC 
TTCCATCAAC ATTGGCAGCT CCGAGGACGA CGGAGTGCTG GCCATTCACG TCAACGCCAA 
GTTGTAAGCT TGTAAACAGC TGGGGAGCAT GGCGAACAGC CAGGCAATTA ATTGAGATTA 
TTAATTGTAC GAAATATATA TGATGAGATT ATAAACACAC AACACTTTTA TTCGCAAGGG 
ATGATAAGAA TCTAATATAT ATATTATCTG GG CTTCAAAG CATTGATTTT ATTTATTGAG 
TCAAGAGGGA AATTTATTTT CTTGTTATTC TCTGTCCAGG TCTAAAGTCC CGAGCGGTGA 
GGCTATCTAT TGATTTGGAC ATTCCAATCG AATACAAAAC AGAGATACAG AAATTTCGGA 
AAAAATTTGA TAACAATCGT GG ATTTT AG C AATTAGACAA ATTGATATGT GCTTGCTAAT 
TGATGTGGCA TGAAATAAGA AATTTATAAG GACGTTTTCA AGTGCTTCTA TTTTAAACAT 
TCAGGATTTT TTTTTAAAGC AGACAGCTTT CAACAGGTTT GATCAGAATT TGAATATTGA 
TTGTTGACTT TAGCTATACA TAAATCACAC CTCATCCACC CATTGTGGTA TCCTTCGAAG 
GACTTGGGAA CTGGATCCTC TAG AG CTTCC GGCCATTAGC CATATTATTC ATTGGTTATA 
TAGCATAAAT CAATATTGGC TATTGGCCAT TGCATACGTT GTATCCATAT CATAATATGT 
ACATTTATAT TGGCTCATGT CCAACATTAC CGCCATGTTG ACATTGATTA TTGACTAGTT 
ATTAATAGTA ATCAATTACG GGGTCATTAG TTCATAGCCC ATATATGGAG TTCCGCGTTA 
CATAACTTAC GGTAAATGGC CCGCCTGGCT GACCGCCCAA CGACCCCCGC C CATTG ACGT 
CAATAATGAC GTATGTTCCC ATAGTAACGC CAATAGGGAC TTTCCATTGA CGTCAATGGG 
TGGAGTATTT ACGGTAAACT GCCCACTTGG CAGTACATCA AGTGTATCAT ATGC CAAGTA 
CGCCCCCTAT TGACGTCAAT GACGGTAAAT GGCCCGCCTG GCATTATGCC CAGTACATGA 
CCTTATGGGA CTTTCCTACT TGGCAGTACA TCTACGTATT AGTCATCGCT ATTACCATGG 
TGATGCGGTT TTGGCAGTAC ATCAATGGGC GTGGATAGCG GTTTGACTCA CGGGGATTTC 
CAAGTCTCCA CCCCATTGAC GTCAATGGGA GTTTGTTTTG GCACCAAAAT CAACGGGACT 
TTCCAAAATG TCGTAACAAC TCCGCCCCAT TGACGCAAAT GGGCGGTAGG CATGTACGGT 
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GGGAGGTCTA TATAAGCAGA GCTCGTTTAG TGAACCGTCA GATCGCCTGG AGACGCCATC 324 0 

CACGCTGTTT TGACCTCCAT AGAAGACACC GGGACCGATC CAGCCTCCGC GGCCCCAAGC 3 3 00 

TCATGGTTCT GGCCAAGAGC ATAGTGTGCC TCGCCCTCCT GGCGGTGGCG CTAGCCCAGT 3 36 0 

ACAATCCCAA CACTCAGGCT GGGAGGACAT CTATCGTGCA TCTCTTTGAA TGGCGCTGGG 3420 

CCGACATTGC ACTGGAGTGC GAACACTATT TAGCTCCTAA TGGGTTTGGA GGAGTTCAGG 3480 

TTTCTCCTCC AAATGAAAAC ATTGTCATTA CTAATCCGAA CAGGCCCTGG TGGGAAAGAT 3 54 0 

ACCAGCCCAT C AG CTACAAG ATCTGCAGTC GATCGGGCAA TGAAAATGAA TTCAGAGACA 360 0 

TGGTGACCAG ATGCAACAAT GTTGGAGTTC GTATTTATGT GGATGCTGTT GTCAATCACA 366 0 

TGTGTGGATC TATGGGTGGC ACGGGCACCC ACTCAACATG TGGGAGCTAT TTCAACACCG 3 720 

GGACTAGAGA TTTTCCCGCT GTGCCGTACT CTGCCTGGGA TTTCAATGAC GGCAAATGTC 3 780 

ACACTGCAAG TGGAGACATC GAAAATTATG GGGACATGTA TCAGGTCCGG GATTGCAAGT 3 84 0 

TGTCCAGCCT TCTTGATCTG GCTCTGGAGA AGGACTATGT ACGCTCAACA ATTGCAGCGT 3 9 00 

ACATGAATCA C CTCATTG AT ATGGGTGTAG CAGGGTTCCG GATCGATGCT GCCAAGCATA 3360 

TGTGGCCAGG GGACATAAGA GCGTTTCTGG ACAAACTGCA CGATCTAAAT ACTCAGTGGT 4.02 0 

TTTCAGCAGG AACGAAACCC TTTATTTACC AAGAGGTAAT TGACTTGGGA GGAGAGCCAA 4 0 80 

TCACAGGCAG TCAGTACTTT GGGAATGGCC GCGTGACAGA ATTCAAGTAT GGTGCCAAAC 4.14 0 

TGGGGACGGT GATCCGGAAG TGGAATGGAG AGAAGATGGC CTACTTAAAG AACTGGGGAG 4 2 00 

AAGGCTGGGG CTTTGTGCCT TCTGACAGAG CCCTGGTGTT TGTGGATAAC CACGACAACC 4260 

AGCGGGGGCA CGGGGCAGGC GGAGCTTCCA TTCTTACTTT CTGGGATGCC AGGCTTTATA 43 2 0 

AAATGGCGGT TGGTTTCATG CTCGCTCATC CGTACGGGTT CACACGGGTG ATGTCAAGTT 4 380 

ATCGTTGGCC AAGATATTTC GAAAACGGAG TGGATGTTAA CGACTGGGTG GGACCACCAA 444 0 

GTAACTCGGA CGGATCGACG AAGTCCGTTA CAATCAACGC AGAC ACT AC C TGTGGCAATG 4 500 

ACTGGGTCTG CGAACATCGC TGGCGACAAA TAAGGAACAT GGTTATCTTC CGTAATGTGG 4 56 0 

TAGACGGTCA GCCTTTCTCA AACTGGTGGG ACAACGGGAG CAATCAAGTA GCTTTCGGTC 462 0 

GCGGCGACAG AGGCTTCATT. GTCTTTAATA ATGATG AC TG GTATATGAAT GTCGATTTGC 4 6 80 

AAACTGGTCT GCCTGCTGGA ACCTACTGCG ATGTTATTTC TGGACAAAAG . GAAGGCAGTG 4 74 0 

CGTGTACTGG AAAGCAGGTG TACGTTTCCT CGGATGGAAA GGCCAATTTC CAGATTAGTA ' 4800 
ACAGCGATGA AGATCCATTT. GTTGCAATTC ACGTTGATGC CAAGTTATAA GGCCGGCCGC . 4 860 
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TTCGAGCAGA CATGATAAGA TACATTGATG AGTTTGGACA AACCACAACT AGAATGCAGT 4 92 0 

GAAAAAAATG CTTTATTTGT GAAATTTGTG ATG CTATTGC TTTATTTGTA ACCATTATAA 4 98 0 

GCTGCAATAA ACAAGTTAAC AACAACAATT GCATTCATTT TATGTTTCAG GTTCAGGGGG 504 0 

AGGTGTGGGA GGTTTTTTAA AGCAAGTAAA ACCTCTACAA ATGTGGTAAA ATCGATAAGG 5100 

ATC CGTCGAC CGATGCCCTT GAGAGCCTTC AACCCAGTCA GCTCCTTCCG GTGGGCGCGG 516 0 

GGCATGACTA TCGTCGCCGC ACTTATGACT GTCTTCTTTA TCATGCAACT CGTAGGACAG 522 0 

GTGCCGGCAG CGCTCTTCCG CTTCCTCGCT CACTGACTCG CTGCGCTCGG TCGTTCGGCT 52 8 0 

GCGGCGAGCG GTATCAGCTC ACTCAAAGGC GGTAATACGG TTATC C AC AG AATCAGGGGA 534 0 

TAACGCAGGA AAGAACATGT GAGCAAAAGG CCAGCAAAAG GCCAGGAACC GTAAAAAGGC 54 00 

CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC GAGCATCACA AAAATCGACG 54 6 0 

CTCAAGTCAG AGGTGGCGAA ACCCGACAGG ACTATAAAGA TACCAGGCGT TTCCCCCTGG .5520 

AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC CCTGCCGCTT ACCGGATACC TGTCCGCCTT 558 0 

TCTCCCTTCG GGAAG CGTGG CGCTTTCTCA TAGCTCACGC TGTAGGTATC TCAGTTCGGT 564 0 

GTAGGTCGTT CGCTCCAAGC TGGGCTGTGT GCACGAACCC CCCGTTCAGC CCGACCGCTG 5 7 00 

CGCCTTATCC GGTAACTATC GTCTTGAGTC CAACCCGGTA AGACACGACT TATCGCCACT 57 6 0 

GGCAGCAGCC ACTGGTAACA GGATTAGCAG AGCGAGGTAT GTAGGCGGTG CTACAGAGTT 5 82 0 

CTTGAAGTGG TGGCCTAACT ACGGCTACAC TAGAAGGACA GTATTTGGTA TCTGCGCTCT 588 0 

GCTGAAGCCA GTTACCTTCG GAAAAAGAGT TGGTAGCTCT TGATCCGGCA AACAAACCAC 5 94 0 

CGCTGGTAGC GGTGGTTTTT TTGTTTGCAA GCAGCAGATT ACGCGCAGAA AAAAAGGATC 6 0 00 

TCAAGAAGAT CCTTTGATCT TTTCTACGGG GTCTGACGCT CAGTGGAACG AAAACTCACG 6 06 0 

TTAAGGGATT TTGGTCATGA GATTATCAAA AAGGATCTTC AC CTAGATC C TTTTAAATTA 612 0 

AAAATGAAGT TTTAAATCAA TCTAAAGTAT ATATGAGTAA ACTTGGTCTG ACAGTTACCA 6180 

ATGCTTAATC AGTGAGGCAC CTATCTCAGC GATCTGTCTA TTTCGTTCAT C CATAGTTG C 6 24 0 

CTGACTCCCC GT CGTGTAG A TAACTACGAT ACGGGAGGGC TTACCATCTG GCCCCAGTGC 6 3 00 

TGCAATGATA CCGCGAGACC CACGCTCACC GGCTCCAGAT TTATCAGCAA TAAACCAGCC 636 0 

AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC TGCAACTTTA TCCGCCTCCA TCCAGTCTAT 642 0 

TAATTGTTGC CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT AATAGTTTGC GCAACGTTGT 64 80 

TGCCATTGCT ACAGGCATCG TGGTGTCACG CTCGTCGTTT GGTATGGCTT CATTCAGCTC 6 54 0 
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CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCGCCATG TTGTGCAAAA AAGCGGTTAG 6 6 00 

CTCCTTCGGT CCTCCGATCG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT CACTCATGGT 6 660 

TATGGCAGCA CTGCATAATT CTCTTACTGT CATGCCATCC GTAAGATGCT TTTCTGTGAC 6 72 0 

TGGTGAGTAC TCAACCAAGT CATTCTGAGA ATAGTGTATG CGGCGACCGA GTTGCTCTTG 6 780 

CCCGGCGTCA ATACGGGATA ATACCGCGCC ACATAGCAGA ACTTTAAAAG TGCTCATCAT 6 84 0 

TGGAAAACGT TCTTCGGGGC GAAAACTCTC AAGGATCTTA CCGCTGTTGA GATCCAGTTC 6 900 

GATGTAACCC ACTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA CCAGCGTTTC 6 96 0 

TGGGTGAGCA AAAACAGGAA GGCAAAATGC CGCAAAAAAG GGAATAAGGG CGACACGGAA 7 02 0 

ATGTTGAATA CTCATACTCT TCCTTTTTCA ATATTATTGA AGCATTTATC AGGGTTATTG 7 08 0 

TCTCATGAGC GGATACATAT TTGAATGTAT TTAGAAAAAT AAACAAATAG GGGTTCCGCG 714 0 

CACATTTCCC CGAAAAGTGC CACCTGACGC GCCCTGTAGC GGCGCATTAA GCGCGGCGGG 72 0 0 

TGTGGTGGTT ACGCGCAGCG TGACCGCTAC ACTTGCCAGC GCCCTAGCGC CCGCTCCTTT 72 6 0 

CGCTTTCTTC CCTTCCTTTC TCGCCACGTT CGCCGGCTTT CCCCGTCAAG CTCTAAATCG 732 0 

GGGGCTCCCT TTAGGGTTCC GATTTAGTGC TTTACGGCAC CTCGACCCCA AAAAACTTGA 73 8 0' 

TTAGGGTGAT GGTTCACGTA GTGGGCCATC GCCCTGATAG ACGGTTTTTC GCCCTTTGAC 74 4 0 

GTTGGAGTCC ACGTTCTTTA ATAGTGGACT CTTGTTCCAA ACTGGAACAA CACTCAACCC 7 5 00 

TATCTCGGTC TATTCTTTTG ATTTATAAGG GATTTTGCCG ATTTCGGCCT ATTGGTTAAA 7 5 60 

AAATGAG CTG ATTTAACAAA AATTTAACGC GAATTTTAAC AAAATATTAA CGTTTACAAT 76 2 0 

TTCCCATTCG CCATTCAGGC TGCGCAACTG TTGGGAAGGG CGATCGGTGC GGGCCTCTTC 76 8 0 

GCTATTACGC CAGCCCAAGC TACCATGATA AGTAAGTAAT ATTAAGGTAC GGGAGGTACT 7 74 0 

TGGAGCGGCC GCAATAAAAT ATCTTTATTT TCATTACATC TGTGTGTTGG TTTTTTGTGT 7 8 00 

GAATCGATAG TACTAACATA CGCTCTCCAT CAAAACAAAA CGAAACAAAA CAAACTAGCA 7 86 0 

AAATAGGCTG TCCCCAGTGC AAGTGCAGGT GCCAGAACAT TTCTCTATCG ATA 7 913 
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THE EMBODIMENTS OF THE INVENTION IN WHICH AN EXCLUSIVE 
PROPERTY OF PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS: 

1 . An extra-cellular reporter gene system for use in a target cell, comprising a DNA 
sequence encoding: 

a signal peptide; 

at least one a-amylase protein; and 
a transcription termination region. 

2. The reporter gene system of claim 1, wherein the a-amylase protein is 
electrophoretically distinct from any endogenous amylase in the target cell and wherein 
the a-amylase protein is compatible with the target cells expression and secretion 
systems. 

3. The reporter gene system of claim 2, wherein the reporter gene system 
comprises DNA sequences encoding two a-amylase proteins. 

4. The reporter gene system of claim 3, wherein the a-amylase proteins are 
electrophoretically distinct from each other. 

5. The reporter gene system of claim 2, wherein the a-amylase protein is selected 
from the group consisting of chicken a-amylase and Drosophila melanogaster a- 
amylase. 

6. The reporter gene system of Claim 5, wherein the signal peptide is derived 
from a D. melanogaster a-amylase gene. 

7. The reporter gene system of Claim 6, wherein the signal peptide is encoded by 
the DNA sequence of SEQID No: 5. 
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8. The reporter gene system of Claim 7, wherein the chicken a-amylase protein 
is encoded by the DNA sequence of SEQID No: 1 . 

9. The reporter gene system of Claim 7, wherein the D. melanogaster a-amylase 
protein is encoded by the DNA sequence of SEQID No: 8. 

10. The reporter gene system of Claim 7, wherein the transcription termination 
region is derived from the group consisting of a D. melanogaster a-amylase gene and 
a late polyadenylation site of SV40. 

11. The reporter gene system of Claim 10, wherein the transcription termination 
region is derived from the D. melanogaster a-amylase gene and is encoded by the 
DNA sequence of SEQID No: 2. 

12. The reporter gene system of Claim 1, wherein said system further comprises 
a promoter region. 

13. The reporter gene system of Claim 12, wherein the promoter region is selected 
from the group consisting of: a Cytomegalovirus promoter, a murine retroviral 
promoter and a rat P-actin promoter. 

14. The reporter gene system of claim 12, wherein the a-amylase protein is 
electrophoretically distinct from any endogenous amylase in the target cell and wherein 
the a-amylase protein is compatible with the target cells expression and secretion 
systems. 

15. The reporter gene system of claim 14, wherein the reporter gene system 
comprises DNA sequences encoding two a-amylase proteins. 

16. The reporter gene system of claim 15, wherein the a-amylase proteins are 
electrophoretically distinct from each other. 
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17. The reporter gene system of claim 14, wherein the a-amylase protein is selected 
from the group consisting of chicken a-amylase and Drosophila melanogaster a- 
amylase. 

18. The reporter gene system of Claim 17, wherein the signal peptide is derived 
from a D. melanogaster a-amylase gene. 

19. The reporter gene system of Claim 18, wherein the signal peptide is encoded 
by the DNA sequence of SEQID No: 5. 

20. The reporter gene system of Claim 17, wherein the chicken a-amylase protein 
is encoded by the DNA sequence of SEQID No: 1. 

21 . The reporter gene system of Claim 17, wherein the D. melanogaster a-amylase 
protein is encoded by the DNA sequence of SEQID No: 8. 

22. The reporter gene system of Claim 17, wherein the transcription termination 
region is derived from the group consisting of a D. melanogaster a-amylase gene and 
a late polyadenylation site of SV40. 

23. The reporter gene system of Claim 22, wherein the transcription termination 
region is derived from the D. melanogaster a-amylase gene and is encoded by the 
DNA sequence of SEQID No: 2. 

24. The reporter gene system of Claim 11, wherein the extra-cellular reporter gene 
system comprising a DNA sequence encoding: 

a signal peptide derived from a Drosophila melanogaster a-amylase gene; 
a chicken a-amylase protein; and 

a transcription termination region derived from a Drosophila melanogaster a- 
amylase gene. 
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25. The reporter gene system of Claim 23, wherein the extra-cellular reporter gene 
system comprising a DNA sequence encoding: 

a Cytomegalovirus promoter, 

a signal peptide derived from a Drosophila melanogaster a-amylase gene; 
a chicken a-amylase protein; and 

a transcription termination region derived from a Drosophila melanogaster a- 
amylase gene. 

26. The reporter gene system of Claim 11, wherein the reporter gene system 
comprising a DNA sequence encoding: 

a signal peptide derived from a Drosophila melanogaster a-amylase gene: 
a Drosophila a-amylase protein; and 

a transcription termination region derived from a Drosophila melanogaster a- 
amylase gene. 

27. The reporter gene system of Claim 23, wherein the extra-cellular reporter gene 
system comprising a DNA sequence encoding: 

a Cytomegalovirus promoter, 

a signal peptide derived from a Drosophila melanogaster a-amylase gene: 
a Drosophila a-amylase protein; and 

a transcription termination region derived from a Drosophila melanogaster a- 
amylase gene. 

28. The reporter gene system of Claim 16, wherein the extra-cellular reporter gene 
system comprising a DNA sequence encoding: 

two expression cassettes, wherein the first cassette comprises a DNA sequence 
encoding: 

a signal peptide derived from a Drosophila melanogaster a-amylase gene; 
a Drosophila a-amylase protein; and 

a transcription termination region derived from a Drosophila melanogaster a- 
amylase gene; 
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and wherein said second cassette comprises a DNA sequence encoding: 
a Cytomegalovirus promoter, 

a signal peptide derived from a D. melanogaster a-amylase gene; 
a chicken a-amylase protein; and 

a transcription termination region derived from a late polyadenylation site of 
SV40; 

wherein the first cassette further comprises a polylinker region immediately 3' of the 
D. melanogaster a-amylase gene to facilitate insertion of a promoter sequence of 
interest to determine the strength of said promoter. 

29. The reporter gene system of any of the preceding claims were the target cell is 
a mammalian cell. 

30. A kit comprising the reporter gene system of any of the preceding claims, a 
means of detecting the amylase activity and instructions for use. 

31. The kit of claim 30 farther comprising a beta-dextrin system to remove non- 
specific proteins from high-protein fluids. 

32. The kit of claim 30 or 31, wherein the means of detecting the amylase system 
is selected from the group consisting of a plate assay and a gel electrophoresis means. 
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5-GCTAGCTCAG TAC AATCCCA ACACTCAGGC TGGG AGGACA TCTATCGTGC ATCTCTTTG A 
ATGGCGCTGG GCCG AC ATTG CACTGGAGTG CGAACACTAT TTAGCTCCTA ATGGGTTTGG 
AGGAGTTCAG GTTTCTCCTC CAAATGAAAA CATTGTCATT ACTAATCCGA AC AG GCCCTG 
GTGGGAAAGA TACCAGCCCA TCAGCTACAA GATCTGCAGT CG ATCGGGCA ATGAAAATGA 
ATTCAGAGAC ATGGTGACCA GATGCAACAA TGTTGGAGTT CGTATTTATG TGGATGCTGT 
TGTCAATCAC ATGTGTGGAT CTATGGGTGG CACGGGCACC CACTCAACAT GTGGGAGCTA 
TTTCAACACC GGGACTAGAG ATTTTCCCGC TGTGCCGTAC TCTGCCTGGG ATTTCAATGA 
CGGCAAATGT CACACTGCAA GTGGAG ACAT CGAAAATTAT GGGG ACATGT ATCAGGTCCG 
GGATTGCAAG TTGTCCAGCC TTCTTGATCT GGCTCTGGAG AAGGACTATG TACGCTCAAC 
AATTGCAGCG TACATGAATC ACCTC ATTG A TATGGGTGTA GCAGGGTTCC GGATCG ATGC 
TGCCAAGCAT ATGTGGCCAG GGGACATAAG AGCGTTTCTG GACAAACTGC ACGATCTA AA 
TACTCAGTGG TTTTCAGCAG GAACGAAACC CTTTATTTAC CAAGAGGTAA TTGACTTGGG 
AGGAGAGCCA ATCACAGGCA GTCAGTACTT TGGG AATGGC CGCGTGACAG AATTCAAGTA 
TGGTGCCAAA CTGGGG ACGG TGATCCGGAA GTGGAATGGA G AG AAGATGG CCTACTTAA A 
GAACTGGGG A GAAGGCTGGG GCTTTGTGCC TTCTG AC AG A GCCCTG GTGT TTGTGGATAA 
CCACGACAAC CAGCGGGGGC ACGGGGCAGG CGGAGCTTCC ATTCTTACTT TCTGGGATGC 
CAGGCTTTAT AAAATGGCGG TTGGTTTCAT GCTCGCTCAT CCGTACGGGT TCACACGGGT 
GATGTCAAGT TATCGTTGGC CAAGATATTT CGAAAACGGA GTGGATGTTA ACGACTGGGT 
GGGACCACCA AGTAACTCGG ACGGATCGAC GAAGTCCGTT ACAATCAACG CAG ACACTAC 
CTGTGGCAAT GACTGGGTCT GCGAACATCG CTGGCGACAA ATAAGGAACA TGGTTATCTT 
CCGTAATGTG GTAGACGGTC AGCCTTTCTC AAACTGGTGG GACAACGGGA GCAATCA AGT 
AGCTTTCGGT CGCGGCGACA GAGGCTTCAT TGTCTTTAAT AATGATGACT GGTATATGAA 
TGTCGATTTG CAAACTGGTC TGCCTGCTGG AACCTACTGC GATGTTATTT CTGGACA AAA 
GGAAGGCAGT GCGTGTACTG GAAAGCAGGT GTACGTTTCC TCGGATGG AA AGGCCA ATTT 
CCAG ATTAGT AACAGCG ATG AAG ATCCATT TGTTGCAATT CACGTTGATG CCAAGTTATA 
AGCTT-3' 
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5 -AAGCTTGT AA ACAGCTGGGG AGCATGGCGA ACAGCCAGGC AATTAATTGA GATTATTAAT 
TGTACGAAAT ATATATGATG AGATTATAAA CACACAACAC TTTTATTCGC AAGGGATGAT 
AAGAATCTAA TATATATATT ATCTGGGCTT CAAAGCATTG ATTTTATTTA TTGAGTCAAG 
AGGGAAATTT ATTTTCTTGT TATTCTCTGT CCAGGTCTAA AGTCCCGAGC GGTGAGGCTA 
TCTATTGATT TGGACATTCC AATCGAATAC AAAACAGAGA TACAGAAATT TGCGAAAAAA 
TTTGATAACA ATCGTGGATT TTACGAATTA GACAAATTGA TATGTGCTTG CTAATTGATG 
TGGCATGAAA TAAGAAATTT ATAAGGACGT TTTCAAGTGC TTCTATTTTA AACATTCAGG 
ATTTTTTTTT AAAGCAGACA GCTTTCAACA GGTTTGATGA GAATTTGAAT ATTGATTGTT 

GACTTTAGCT ATACATAAAT CACACCTCAT CCACCCATTG TGGTATCCTT CGAAGGACTT 

o 

GGGAACTGGA TCCTCTAGAA GGCGCGCC-3' 
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